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We claim: 

1. A combinatorial library comprising variants of a parental ligand binding molecule, wherein 
said parental ligand binding molecule comprises an immunoglobulin Vh fragnient comprising 
at least in substantial part, at least the FR regions of the immunoglobulin Vh domain 
depicted in one of Figures 1 or 2 and wherein said variants comprise, at least in substantial 
part, at least the FR regions of the immunoglobulin Vh domain depicted in one of Figures 1 
or 2 and differ from said parental ligand binding molecule at amino acid residues 
constituting at least part of at least one of the CDRs of said parental ligand binding 
molecule. 

2. A library according to claim 1, wherein said parental ligand binding molecule is a 
substantially intact four chain antibody or a binding fragment thereof including an Fd 
fragment, an Fab fragment, an Fabc fragment, a F(ab')2 fragment, F(ab)2 fragment, a single 
chain V region fragment (scFv), or a fiision polypeptide, wherein the fusion polypeptide 
comprises any such parental ligand binding molecule fiised to another polypeptide. 

3. A library according to claim 1, wherein said parental ligand binding molecule is a dAb. 

4. A library according to claim 1, having a substantial representation of variants which have a 
CDRS that is 16 to 33 amino acids in length. 

5. A library according to claim 4, wherein substantially all of said variants have a CDR3 that is 
the same length. 

6. A library according to claim 4, wherein said variants have CDR3s which vary in length. 

7. (Amended) A library according to claim 5 [or 6], wherein a substantial proportion of said 
variants have a CDR3 that is 18 to 28 amino acids in length. 

76 



(Amended) A library according to claim 5 [or 6], wherein a substantial proportion of said 
variants have a CDR3 that is 20 to 25 amino acids in length. 

A library according to claim 5, wherein a substantial proportion of said variants have a 
CDR3 that is 23 amino acids in length. 

A library according to claim 4, wherein said variants vary from said parental ligand binding 
molecule in an amino acids constituting at least part of the CDR3. 

A library according to claim 10, wherein said parental ligand binding molecule comprises an 
immunoglobulin Vh binding fragment comprising, at least in substantial part, the CDR3 
region of the immunoglobulin Vh domain depicted in Figure 1 . 

A library according to claim 4, wherein said parental ligand binding molecule comprises an 
immunoglobulin Vh binding fragment comprising, at least in substantial part, the CDR 
regions of the immunoglobulin Vh domain depicted in Figure 1. 

A library according to claim 4, wherein said variants comprise the same FR regions as said 
parental binding molecule. 

A phage display library according to claim 4, wherein said parental ligand binding molecule 
comprises the entire FR regions of the immunoglobulin Vh domain depicted in one of 
Figures 1 and 2. 

A library according to claim 1, wherein said parental ligand binding molecule comprises at 
least in substantial part the FR2 region of the immunoglobulin Vh domain depicted in Figure 
1 , including residues 44, 45 and 47, and wherein the FR2 regions is at least partially 
randomized to generate variants having one or more hydrophilic amino acids at VH-VL 
interface. 
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A library according to claim 4, wherein said variants vary from said parental ligand binding 
molecule at amino acids which are proximal to the carboxy terminus of the CDR3. 

A library according to claim 12, wherein said variants vary from said parental ligand 
binding molecule in an amino acids which are immediately upstream of position lOOo. 

A library according to claim 4, wherein said variants vary from said parental ligand binding 
molecule in an amino acids lOOi to lOOn identified in SEQ. ED. NOS.: 1, 2 or 3. 

(Amended) A library according to claim 4, wherein said parental ligand binding molecule is 
derived from a human Vh domain identified in Figure 1 or is built on any framework which 
is at least 80% homologous [(preferably 85% homologous, more preferably at least 90% 
homologous)] to the framework and other conserved regions of said human Vh domain. 

(Amended) A library according to any claim 4 [or 19], wherein said parental ligand binding 
molecule is built on a Vh framework which is at least 80% homologous [(preferably 85% 
homologous, more preferably at least 90% homologous)] to the framework regions and 
conserved regions of a human Vh domain. 
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II. Priority 
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R Item II 
Priority 

1 _ There appears to be no basis for the subject-matter referred to in claims 1-37 in 
the priority documents. Thus, priority is considered invalid for these claims. 

2. Priority for claims 38-88 was found valid as patent application 60/163,546 filed at 
the USTPO on 04.1 1.1999 represents a basis for the subject-matter referred to in 
the said claims. 

Re Item HI 

Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

1 . The PCT requires that the subject-matter for which protection Is sought shall be 
defined by technical features (Rule 6.3(a) PCT). In claim 33, however, the 
products to be protected are only defined by the process for their production. This 
results in a lack of clarity to such an extent that no meaningful opinion can be 
fomned (Art 34(4)(a)(ii) PCT). 

2. Claims 75 and 76 define their subject-matter by the result to be achieved without 
providing relevant technical information (Rule 6.3(a) PCT). The claims in their 
present form lack therefore clarity to such an extent that no meaningful opinion 
can be formed (Art 34(4)(a)(ii) PCT). The dependent claims 77-83 do not provide 
technical features which overcome the said objection. Thus, these claims also 
lack clarity to such an extent that no meaningful opinion can be fomned (Art 
34(4)(a)(ii) PCT). 

Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive st p 
or industrial applicability; citations and explanations supporting such statem nt 

1. Basis for the assessment of novelty, inventive step and industrial 
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applicability 

1 , 1 Reference is made to the following document: 

D1 : Y. REITER ET AL.: 'An antibody single-domain phage display library of a 
native heavy chain variable region: Isolation of functional single-domain VH 
molecules with a unique interface.' JOURNAL OF MOLECULAR BIOLOGY, 
vol. 290, no. 3, 16 July 1999 (1999-07-16). pages 685-698, XP002165613 
Oxford, GB cited in the application 

2. Novelty 

2.1 Document D1 discloses an antibody single-domain phage display library 

constructed on the basis of a native heavy chain variable (VH) region (D1 , Title). 
The VH domain scaffold originates from mouse and encompasses four frame 
work (FR) domains and the complementarity determining regions CDR-1 , -2 and - 
3 (D1, page 695, "Library construction"; page 689, Fig. la and lb). All frame work 
regions FR1-FR4 disclosed in D1 are partially identical to those referred to in Fig. 
1 of the present application (compare D1 , page 688, Fig. 1 with Fig. 1 of present 
patent application: FR1: aa2-aa8; FR2: aa38-aa39 FR3: aa85-aa88; FR4: aa103- 
aa1 11). The CDR-3 domain is randomised (D1 , page 695, "Library construction"; 
page 689, Fig. la and lb) The VH domain is cloned in absence of a light chain as 
a fusion protein to the phage minor coat protein encoded by gene 3 of the phage 
and electroporated into the microorganism E. coli to establish the library from 
which, in a subsequent step, the phage for panning are produced (D1, page 686, 
"Design and construction..."). For further characterisation of the binding domain, 
soluble VH protein is produced (D1 , page 689, "Characterisation of phage 
clones"). The randomised CDR-3 has a length of 9 amino acids (D1, page 688, 
Fig. 1a). 

Therefore, claims 1-3, 15, 29-32, 34, 38 and 84 lack novelty (Art 33(2) PCT). In 
conclusion, claims 4-14, 16-28, 35-37, 39-74, 85-88 are novel (Art 33(2) PCT) 

3. inventive step 

3.1 It appears that claims dependent claims 4-14,16-28, 35-37, 39-74 do not disclose 
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features which in combination with the features of the claims to which they refer 
fulfil the requirements of Art 33(3) PCT for inventive step. 

3.2 The method referred to in claim 85 appears to represent a cloning strategy 
whereby, starting from a parental DNA sequence, a "variable" region of said 
parental DNA sequence is replaced through the step-wise in vitro addition of 
"nucleic acids" which are selected from discrete pools. Claim 85 differs from the 
closest prior art document D1 in that randomisation of a section of DNA takes 
place through the addition of "nucleic acids" instead of single nucleotides as 
disclosed in D1 (D1 , page 695, "Library construction"). The technical effect is that 
the section of DNA can be biased towards a certain sequence coding for a 
particular amino acid. The technical problem is to provide an improved method of 
DNA randomisation. The solution is to use "nucleic acids" instead of single 
nucleotides for the construction of the randomised stretch of DNA. 
It appears that an inventive step (Art 33(3) PCT) can be acknowledged for said 
solution as none of the documents cited in the ISR disclose or indicate said 
solution. The dependent claims 86-88 are thus also inventive (Art 33(3) PCT). 



4. Industrial applicability 



4.1 The subject-matter disclosed in the claims 1-32, 34-74, 84-88 of the present . 
application appears to be industrially applicable (Art 33(4) PCT). 



Re Item VII 

Certain defects in the international application 

1 . The expression "herein incorporated by reference" or equivalents thereof (e.g. 
page 19, first paragraph etc.) in the description of the present application should 
have been deleted (Guidelines, Section IV, 11-4.17). 

2. The vague and imprecise statement in the description of the present application 
(page 68, second paragraph) implies that the subject-matter for which protection 
is sought may be different to that defined by the claims, thereby resulting in lack of 
clarity of the claims (Art. 6 PCT) when used to interpret them (Guidelines, Section 
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IV, lll-4.3a). The statement should therefore have been amended to remove this 
inconsistency. 

3. To meet the requirements of Art 5 and Rule 5 PCT, the document D1 should be 
identified in the description and the relevant background art disclosed therein 
should have been briefly discussed if the subject-matter for which this document is 
relevant prior art remains in the claims. 



Re Item VIII 

Certain observations on the international application 

1 . Claim 85 refers to a method of creating a library whereby by "a series of step-wise 
synthesis variant nucleic acids" are added to certain positions. These nucleic 
acids are selected from "discrete pools of nucleic acids". The description appears 
not to contain a basis for these features, resulting in a lack of support by the 
description (Art 6 PCT). 

2. The temns "substantial" and "partial" as used in claims 1.2.4, 25, 26 etc. result in 
a lack of clarity (Art 6 PCT). 

3. The abbreviations "dAb" (e.g. claim 3) and "FR" (e.g. claim 13) should have been 
defined in the claim to resolve a lack of clarity (Art 6 PCT). 

4. Abbreviations should be used with the same writing throughout the application 
(e.g. "Vh" in claim 14 but "VH" in claim 15). 

5. Claims shall not refer to figures unless where absolutely necessary (Rule 6.2(a) 
PCT). These references should have been replaced by the relevant sequence 
identification numbers (see page 15, second paragraph). 

6. The term "non-camelid type" in claim 38 lacks clarity (Art 6 PCT) as there is no 
definition of a "camelid type" in the claim. 

7. The terms "upstream" and "downstream" used, for example, in claim 38 result in a 
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lack of clarity and support by the description (Art 6 PCX). It appears that what is 
meant is the region FR1-FR4 according to the numbering by Kabat et al. (page 
15, second paragraph). 
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ENHANCED PHAGE DISPLAY LmRARIES OF HUMAN VH FRAGMENTS AND 

METHODS FOR PRODUCING SAME 

Field of the Invention 

The present invention relates to combinatorial libraries including phage display libraries 
vi^hich display binding fragments having preferred characteristics of solubility. The present 
invention also describes methods of producing phage libraries in which the phage population 
displays binding fragments having characteristics w^hich are biased towards characteristics of 
the wild type or parental binding fragment. 

Bacicground of the Invention 

Developments in antibody engineering and recombinant DNA technology haye made it 
possible to generate forms of recombinant antibody fragments which, in many ways, are 
functional substitutes of larger intact immunoglobulin molecules. Single heavy domain 
("dAb") antibody fragments have been the subject of several reports in the patent and 
scientific literature. The literature reports efforts to generate phage display libraries of such 
fragments for biopanning against a target ligand. 

U.S. Patent No. 5,702,892 C892) discloses a phage display library constructed in an M13 
derived expression vector, in which recombinant phage of the library contain a 
polynucleotide encoding a fusion protein which comprises a phage coat protein and an 
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immunoglobulin heavy chain binding-fragment. The heavy-chain binding-fragment spans 
from a position upstream of CDRl to a position downstream of CDR3. '892 describes that 
the DNA sequence encoding the CDR3 region and/or the CDRl region may be randomly 
varied so that the population of phage expresses a series of potential heavy chain binding 
domains for panning against the target ligand. 

U.S. Patent No. 5,759,808 discloses a phage display library comprising a population of phage 
based on random variation of a cDNA sequence obtained from lymphocytes of camelids 
previously immunized with target antigens. Camelid heavy chain antibodies occur naturally, 
in a composition of about 45%, as heavy chain dimers. Heavy chain antibodies specific for a 
target antigen may be generated by immunizing a member of the camelid species with the 
target antigen (see Lauwereys et al. (1998) The EMBO J. 17, 3512-3520). 

Hamers-Casterman et al. (1993) Nature 363, 446-448 report that camelid heavy chain 
antibodies are naturally more hydrophilic at amino acid residues at locations 44, 45 and 47 
(Kabat numbering system), in FR2, which corresponds to the surface where they normally 
contact the Vl domain. Another salient feature of a camelid Vh is that it generally has a 
comparatively longer CDR3 with a high incidence of cysteines and thus may form, via paired 
cysteines in CDRl and CDR3, exposed loops, which are more amenable to binding into 
cavities such as the active site of enzymes and antibodies (Desmyter et al. (1996) Nat. Stmct. 
Biol. Vol. 3, No. 9, p. 803). However, it has been questioned whether single domain 
antibodies with desired affinities can be generated with such configurations in the absence of 
prior immunization, i.e. with a naive library (Lauwereys et al. (1998) supra). 
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The present invention discloses advances in the technology related to creating libraries 
containing immunoglobulin-like proteins that specifically bind target Ugands eg. antigens. 

Summary of the Invention 

The invention is directed to a population of variants of at least one parental hgand-binding 
molecule, wherein said parental ligand-binding molecule coihprises an immunoglobulin Vh 
binding fragment comprising, at least in substantial part, at least the framework (FR) regions 
of the immunoglobulin Vh fragment depicted in one of Figures 1 or 2 and wherein said 
variants comprise at least in substantial part, the FR regions of the immunoglobulin Vh 
fragment depicted in one of Figures 1 or 2 and differ from said parental ligand-binding 
molecule in amino acid residues constituting at least part of at least one of the CDRs of said 
parental Ugand-binding molecule. Preferably said population of variants is constituted by one 
or more combinatorial Ubraries of such variants, for example, protein arrays, phage display 
libraries, ribosome display libraries etc. 

It is to be understood that the variants may (though not necessarily) form part of another 
structure or molecule, for example in the case of phage display, part of the coat protein of the 
phage. Accordingly, the term variant is used broadly to refer to variants of the essential 
molecule (a Ugand-binding molecule) when forming part of another structure or molecule 
(eg. as in phage display or ribosome display) or when independent of any such combination, 
eg. in tiie case of protein arrays whose members maynot be associated with individual 
supporting structiires/molecules. 
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In another aspect, the invention is directed to a ligand-binding molecule which has been 
identified as binding to a target ligand by screening a combinatorial library of the invention 
for one or more ligand-binding molecules which specifically recognize said target ligand. The 
invention is also directed more generally to any specific such ligand-binding molecule which 
is derived fi-om such combinatorial library of the invention. It is understood herein that such 
specific ligand-binding molecule may be directly obtained fi-om such a library or may be 
indirectly derived, for example, through the course of further antibody engineering or other 
modification steps (eg. creating fi-agments, derivatives, a secondary library, etc) using a 
ligand-binding molecule directly or indirectly obtained fi-om such library. It also imderstood 
that the invention excludes known ligand-binding molecules. In one embodiment of this 
aspect of the invention the target ligand is a cancer antigen. 

Figures 2, 3 and 4 depict preferred variations, more fiilly described below, on the preferred 
immunoglobulin Vr binding fi-agment and/or nucleic acid construct (depicted in Figiire 1. 
Figure 1 describes a wild-type parental immunoglobulin Vh binding fi-agment derived ft^om 
human monoclonal antibody BT32/A6 (hereinafter referred to as "A6") partially described in 
U.S Patent No. 5,639,863 (hereinafter referred to as the *863 patent). It has now been found 
that A6 has preferred solubility and other characteristics which lend themselves well to the 
creation of libraries, including naive libraries, of various types of human immunoglobulin 
fi-agments including scFvs, Fds, Fabs etc., as more fiilly described below. Accordingly, A6, 
and in particular, as more fiilly described below, at least a substantial part of the fi-amework 
(FR) regions of the A6 Vh fi^agment depicted in Figure 1, alone or in combination with 
features of its CDR3, provide a usefiil departure point, in the form of a parental ligand- 
binding molecule, for the randomization or partial randomization of amino acid residues 
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which tend to play a predominant role in ligand-binding, namely the CDR regions of the 
heavy chain and particularly the CDR3 of the heavy chain. As more fully described below, 
the nucleic acid changes (removal of the recombination site) relative to A6 wild-type Figure 
Ireflected in Figure 14 may be incorporated into Figures 1 or 2 to create preferred Figures 3 
and 4, respectively. 

The combinatorial library of the invention may be generated by phage display. Accordingly, 
in a preferred embodiment of one aspect of the invention, the invention is directed to a phage 
display library displaying a plurality of different variants of a parental ligand-binding 
molecule, wherein said parental ligand-binding molecule comprises an immunoglobulin Vh 
binding fragment comprising, at least in substantial part, at least the FR regions of the 
immunoglobulin Vh fragment depicted in one of Figures 1 or 2 and wherein said variants are 
encoded by nucleic acid sequences which vary from the nucleic acid sequence encoding said 
parental ligand-binding molecule in a sub-sequence (at least one) encoding at least part of 
one of the CDRs of said parental ligand-binding molecule, preferably the CDR3, whereby 
said plurality of variants comprise at least, in substantial part, the FR regions of the 
immunoglobulin Vh fragment depicted in one of Figures 1 or 2 and are differentiated, at least 
in part, by amino acid variations encoded by variations in said sub-sequence. 

In a preferred embodiment, in addition to substantial preservation and optional improvement 
of the FR regions of A6, the A6-based parental ligand molecule comprises (and therefore 
preserves within members of the library), in substantial part (subject to at least partial 
randomization of selected regions of one of the CDRs, preferably the CDR3, to create 
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binding diversity within the library), one or more of the CDR regions of the A6 Vh fragment, 
particularly the CDR3. In a further preferred embodiment, at least the length of the wild-type 
Vh CDR3 (23 amino acids) and preferably also elements of its amino acid composition, is 
preserved or at least partially preserved (approximately! 6-23 amino acids and more 
particularly 18 to 23 amino acids). Optionally the CDR3 may also be lengthened by 
approximately 1 to 10 residues. The Ubrary may optionally have representation of binding 
molecules having CDR3s of varying lengths. 

In a preferred aspect of the invention the parental ligand-binding molecule is a dAb fragment. 
It is known that a dAb molecule , due to the removal of its light chain partner, tends to, in 
most, if not all cases, aggregate, in varying degrees due to the "sticky" nature of the Vl 
interface. This stickyness is attributable, at least in part, to the hydrophobic nature of the Vh 
residues at this interface. This stickyness results in substantial dimer and/or multimer 
fomiation which may reduce, on the whole, the solubility characteristics of members of the 
library. Accordingly, in a further preferred aspect of the invention A6 Vh amino acid residues 
at the VL interface are substituted by residues which tend to minimize aggregate formation, 
for example, hydrophilic amino acids, and preferably one or more of the substitutions 
reflected in Figure 2, relative to Figure 1 . 

Altematively, in yet a further preferred embodiment of the invention, more fiiUy described 
below, such substitutions are not fixed within the entire population of the library, but are 
introduced by randomizing or partially randomizing various A6 Vh amino acid residues, 
particularly including FR residues, among the residues at the interface, (see for example. 
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Padlan et al "Anatomy of the Antibody Molecule** Molecular Immunology Vol. 31, pl69- 
217, Table 25 for itemization and related discussion of these residues). 

Altematively, in yet a further preferred embodiment of the invention, FR regions, other than, 
or in addition to, modifications to the Vl interface (FR2) may be modified by at least partial 
randomization, for example, one or both of FRl (one or more of residues 4 to 21) and FR4 
(one or more of residues lOOo to 1 13) to improve, on the whole, the solubility characteristics 
of members of the library (for example, biasing at least some and preferably all of one or both 
of these sets of residues (at least 70% or more), preferably 90% in favour of the parental 
amino acid constitution to achieve 10% randomization). 

In the case of A6 dAb firagments, it has been found that recombination events within the 
nucleic acid sequence encoding the Vh binding fi'agment tend to result in deletions yielding 
shorter molecules, with possibly comprimised binding characteristics. Thus, in a fiirther 
preferred aspect of the invention exemplified in Figures 3 and 4, and nucleic acid sequences 
which promote such recombination events (at putative recombination sites) are substituted, to 
oppose this tendency, preferably in a manner that does not result in an amino acid change. 
These changes may be incorporated into the wild-type A6 (see Figure 3) or improved 
variations thereof exemplified in Figure 2 (see Figure 4) which have a reduced tendency 
towards aggregation. 

Thus, in particularly preferred aspects, the present invention provides a heterogeneous 
population of genetic packages (eg. phage) having a genetically determined outer surface 
protein, wherein the genetic packages collectively display a plurality of different, preferably 
human, (ie. having substantial identity, preferably at least 80% homology to human 
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framework and other conserved regions) Vh ligand-binding fragments, each genetic 
package including a nucleic acid construct coding for a fusion protein which comprises at 
least a portion of the outer surface protein and a variant of at least one soluble parental 
ligand-binding fragment preferably derived from or having a substantial part of the FR 
regions of the amino acid sequence identified in one of Figures 1 or 2 (or a sequence at least 
80%, preferably 85 to 100%, more preferably 90-100%, homologous (% identity) thereto), 
wherein the variant Vh ligand-binding fragments preferably span from a position upstream of 
an immunoglobulin heavy chain CDRl to a position downstream of CDR3 (preferably 
including substantially all of FRl and/or FR4), and wherein at least part of a CDR, preferably 
the CDR3, is a randomly generated variant of a CDR of said parental Vh ligand binding- 
fragment and wherein the fiision protein is preferably expressed in the absence of an 
immunoglobulin light chain whereby the variant Vh ligand-binding fragments are, on the 
whole, better adapted to be or better capable of being expressed as soluble proteins. 

In yet another embodiment of the invention, by biasing the amino acid constitution, 
preferably on an individual amino acid by amino acid basis, in favor of the wild-type or 
parental amino acid constitution, even portions of the parental ligand-binding molecule that 
are randomized in favor of generating variability in the variant binding fragments can be 
engineered to maintain favorable solubility characteristics of the parental binding domain. 
Preferably, a portion of the construct encoding at least part of the CDR3 is biased or partially 
biased in favor of the parental amino acid constitution. 

In a further preferred embodiment, the parental Vh binding-fragment naturally has a long 
CDR3 that is amenable to forming exposed loops for binding into cavities. In a most 
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preferred embodiment, the parental Vh ligand-binding fragment is built on a human 
framework or is adapted from or adaptable to a human framework. 

In another preferred embodiment, the preferred binding region of the variants (corresponding 
to the randomized or partially randomized part of the CDR3) is located in carboxy terminal 
region oftheCDR3. 

In summary, according to the invention, a substantial part of the amino acid sequence 
identified in Figure 1, preferably including at least part of the CDR3, supplies the preferred 
amino acid constitution of the various preferred parental Ugand-binding molecules, such that 
a population of variant heavy chain ligand-binding molecules built on this framework of 
amino acids are on the whole better adapted to be or better capable of being expressed as 
soluble proteins. 

Brief Description of the Drawings 

The invention will now be described with reference to the drawings, wherein: 

Figure 1 is a sequence diagram showing a parental Vh ligand-binding molecule (A6) 
according to the invention. 

Figure 2 is a sequence diagram showing another parental Vh ligand-binding molecule (A6.1) 
according to the invention, additionally showing modified nucleic acid bases corresponding 



9 

SUBSTITUTE SHEET (RULE 26) 



wo 01/18058 



PCT/CAOO/01027 



to amino acids 24 and 25, for introducing the Nhel site. Introduction of this Nhel site does 
not alter the amino acid constitution of A6. 1 . 

Figure 3 is a sequence diagram showing the A6 Vh ligand-binding molecule (encoded by A6- 
chi(-)) according to the invention, in which the nucleic acid residues corresponding to amino 
acids 3 to 16 of A6 wild-type have been modified to remove a putative recombination site, 
leaving the amino acid constitution of A6 unchanged. 

Figure 4 is a sequence diagram showing the A6.1 Vh ligand-binding molecule (encoded by 
A6.1-chi (-)) according to the invention, in which nucleic acids corresponding to amino acids 
3 to 16 of A6.1 have been modified to remove a putative recombination site, leaving the 
amino acid constitution of A6.1 imchanged. The altered nucleic acid residues corresponding 
to Nhel are also shown. 

Figure 5 is a facsimile of an SDS-PAGE showing high expression of human A6. 1 dAb in E. 
CoU. 

Figure 6 shows size exclusion chromatograms of molecular weight markers (A), dAb (B) and 
A6.1 dAb obtained by gel filtration obtained using a Superdex 75 column. The masses of the 
markers were 2,000, 67, 43, 25 and 14 kDa. 

Figure 7 is a size exclusion chromatogram of A6.1 dAb following IMAC purification 
showing molecular weights associated with the peaks. 
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Figure 8 is an NMR 2-D spectra showing the molecular configuration of two embodiments of 
an A6.1 based dAb. In particular, the two 15N-1H HSQC spectra are: a: R3A10(Cys-), the 
spectrum was acquired at 308 K; b: M2R2-1, the spectrum was acquired at 298 K. 

Figure 9 is diagrammatic representation of the amino acid substitutions in parental ligand- 
binding molecule A6.1 and A6.1C relative to wild-type A6. 

Fisure 10 is a sequence diagram showing a parental Vh ligand-binding molecule designated 
A6.1C. 

Figure 1 1 is a graphic representation of the binding characteristics of A6.1C library binders to 
3B1 and control BSA. 

Figure 12 is a sensogram overlay showing the binding characteristics of a potential Vh 
binding fragment generated against anti-FLAG antibody (M2) using a phage display Ubrary 
of the invention. 

Figure 13 is a diagrammatic representation of vector, SJFI, used to create the vector into 
which the library is cloned. 

Figure 14 is a listing of the nucleotide and amino acid sequence of A6 Vh after introduction 
of the Nhel site and removal of the putative recombination site at amino acid residues 3 to 
16. 
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Figxire 15 is a schematic representation of steps taken to remove the putative recombination 
site of the 5 end of the A6 Vh gene. 

SEQ. ID. NO. 1 corresponds to the nucleic acid sequence shown in Figure 1. 

SEQ. ID. NO. 2 corresponds to the amino acid sequence shown in Figure 1 . 

SEQ. ID. NO. 3 corresponds to the amino acid sequence of CDRl shown in Figure 1. 

SEQ. ID. NO. 4 corresponds to the amino acid sequence of CDR2 shown in Figure 1. 

SEQ. ID. NO. 5 corresponds to the amino acid sequence of CDR3 shown in Figure 1. 

SEQ. ID. NOS. 6-11 and 25 correspond to the nucleotide sequences of primers disclosed 
herein. 

SEQ. ED. NOS. 12-24 correspond to the amino acid sequences of CDR3 variants disclosed 
herein at Table 2. 

Detailed Description of Preferred Embodiments 

In a preferred embodiment, the invention is directed to a population of genetic packages 
having a genetically determined outer surface protein including genetic packages which 
collectively display a plurality of different ligand-binding molecules in association with the 
outer surface protein, each package including a nucleic acid construct coding for a fusion 
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protein which is at least a portion of the outer surface protein and a variant of at least one 
soluble parental ligand-binding molecule derived from or having the amino acid sequence 
identified in Figure 1 (or a sequence preferably at least 80% homologous in the framework 
and conserved regions thereof), wherein at least part of the construct, preferably including at 
least part of the CDR3 identified in Figure 1, encodes or is biased in favor of encoding, the 
amino acid constitution of the parental ligand binding Augment such that the plurality of 
different ligand-binding domains are on the whole better adapted to be or better capable of 
being expressed as soluble proteins. The variant Vh ligand-binding molecules are preferably 
characterized by a CDR3 having 16 to 33 amino acids. 

Preferably, the replicable genetic package is a recombinant phage and the heterogeneous 
population of replicable genetic packages collectively constitute a phage display library. 

In a preferred embodiment, the parental ligand-binding molecule is a Vh binding fragment, 
and the plurality of variant ligand-binding fragments are expressed in the absence of light 
chains. In another embodiment, the parental ligand-binding-molecule is a natural occurring 
antibody or fragment thereof, having a natural human Vl interface. In another embodiment, 
the Vl interface is engineered to avoid hydrophobic amino acids. In another embodiment, the 
Vl interface is engineered for amino acids, which form weak interactions. In another 
embodiment the parental ligand binding molecule has a camelid type Vl interface. In another 
embodiment, at least one of the Vl interface amino acids are randomized or partially 
randomized in the constmction of the library. 
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Preferably the potential Vh binding fragments include the entire FRl through to FR4 regions, 
although it is to be understood that partial deletions, for example, within CDR2, are 
contemplated to be within the scope of the invention. 

Preferably, CDR3s of a variety of different lengths from 16 to 33 amino acids are 
predominantly represented among the potential Vh binding fragments. Preferably CDR3s of a 
variety of different lengths, from 18 to 28 amino acids, or from 20 to 25, or from 18 to 23, 
amino acids are predominantly represented in the library. In a preferred embodiment of the 
invention, the parental Vh ligand-binding fragment is built on a human framework and 
preferably is the parental Vh ligand-binding fragment identified in Figure 1 which has a 
CDR3 of 23 amino acids in length. 

The invention encompasses a phage display library which is constructed using a parental Vh 
ligand-binding molecule derived from a human parental Vh ligand-binding fragment, or is 
built on any framework which is at least 80% (preferably 85%, more preferably 90 to 95%) 
homologous to the framework and other conserved regions of a fully human Vh chain. The 
invention also contemplates that the parental Vh binding-fragment, though not human, is 
adapted (eg. humanized) or adaptable (eg. to be adapted after selection of preferred binders) 
to a human framework. 

In another embodiment, the invention also contemplates the random, biased or fixed 
occurrence of features disclosed in the camelid literature, for example pairable cysteines in 
CDRl and CDR3 (optional) and/or the substitution of hydrophilic amino acids at least one of 
positions 44, 45, and 47 and preferably also positions 93 and 94 (Kabat numbering system). 
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In a most preferred embodiment of the invention, the parental ligand-binding molecule is a 
Vh fragment derived from a human IgM heavy chain, and preferably comprises FRl through 
FR4 of the Vh chain. A partial sequence of the preferred antibody BT32/A6 (A6) is 
disclosed in U.S. Patent No. 5,639,863, incorporated herein by reference. The entire 
sequence is supplied now in Figure 1. 

In Figure 1, the CDR regions are demarcated. The amino acid residue numbers in Figure 1 
and throughout the disclosure refer to the Kabat numbering system (Kabat et al. 1991, 
Sequences of Proteins of Immimological Interest, publication No. 91-3242, U.S. Public 
Health Services, NIH, Bethesda MD) except in the sequence listings and where explicitly 
stated or otherwise implied. Figure 1 corresponds to SEQ. ID. NO. 1 (nucleic acid) and 
SEQ.ID. NO. 2 (amino acid). Figure 1 demarcates and labels regions CDRl (corresponding 
to SEQ. ID. NO. 3), CDR2 (SEQ. ID. NO. 4) and CDR3 (SEQ. ID. NO. 5). 

In addition to other types of antibody fragments (e.g. scFv, FAb, etc.) the A6 framework 
provides preferred solubility characteristics for creating dAb libraries. The term preferred 
solubility characteristics, as used herein, refers to at least one of the several, often correlated, 
characteristics including good yield, expression as a soluble product (as opposed to inclusion 
bodies) within the periplasm of the host organism, eg. Escherichia. Coli. and a reduced 
tendency to dimerize and other aggregate formation. 

The terms "polypeptide", "peptide" and "protein", unless the context implies otherwise., are 
used interchangeably herein, to refer to polymers of amino acid residues of any length. 
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The term "combinatorial library" is used herein to refer to a set of molecules, typically 
belonging to a defined (narrowly or broadly) class comprising a substantial number of 
potentially useful variants, wherein the variations in the molecule represent a complete or 
partial set of permutations or combinations of at least some constituent elements of a 
reference molecule, which is typically a template or "parental" molecule, or simply the class 
itself For clarity, in the case of polypeptides and nucleic acids, the constituent elements are 
amino acids and nucleic acid bases, respectively. 

As used herein, the phrase "in substantial part" refers to variations relative to a referenced 
molecule which do not significantly impair the "functionality" of that molecule. In the case 
of the parental ligand-binding molecule and variants thereof, functionality refers primarily to 
the solubility and binding characteristics of the molecule. Such variations (ie. the referenced 
molecule in substantial part) can be tested systematically to assess their impact. In the case 
of framework regions, in contrast to CDR regions, due to the substantial conservation of the 
fi^mework amino acid residues, a substantial part of the fi-amework would preferably refer to 
at least 80% identity of the amino acid residues and more preferably an 85 to 100% identity, 
and even more preferably at least a 90% identity of the amino acid residues. However, it is 
understood that each of the previous percentages could be relaxed to discount instances 
where the absence of identity in a given residue, is due to a well recognized conservative 
amino acid substitution, or where a particular class of functionality is noted, e.g. hydrophilic, 
if the substitution is with a residue of the same class. In the case of CDR residues, these 
numbers could be considerably even more relaxed. The term "in substantial part", in 
reference to portions of framework and CDR regions, also contemplates the possibility of 
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additions and deletions in those regions which do not impact the solubility and binding 
characteristics of the ligand-binding molecule in question. 

The term ligand-binding fragment is used broadly to define the whole or any part of an 
antibody that is capable of specifically binding to any ligand, in the broadest sense of the 
term ligand. 

An A6-based human heavy domain ligand-binding-fi-agment is well suited for the 
development of a combinatorial library (optionally a phage display library) that is used to 
generate soluble binding fiagments that are usefiil for human diagnosis and therapy (due to 
limited HAMA response). These phage display libraries are used to selectively generate 
molecular probes that specifically interact with a ligand, including without limitation, natural 
and synthetic molecules and macromolecules and can be used in vitro (i.e., a diagnostic) and 
in vivo (i.e., a diagnostic and/or therapeutic) as indicators, inhibitors and immunological 
agents. The types of natural and synthetic molecules and macromolecules include but are not 
limited to: antibodies and fragments thereof; enzymes; cell receptors; proteins, polypeptides, 
peptides; polynucleotides, oligonucleotides; carbohydrates such as polysaccharides, 
oligosaccharides, saccharides; lipids; organic-based and inorganic-based molecules such as 
antibiotics, steroids, homiones, pesticides, herbicides, dyes, polymers. 

As shown in Figure 5, a facsimile of an SDS-Page, A6.1 Vh has preferred solubility 
characteristics. This SDS-PAGE shows a particularly heavy band showing strong 
expression in E. Coli of an A6.1 dAb, designated R3A10. R3A10 was expressed as a 
soluble Vh in E. Coli. Yields as high as 55 mg/L of bacterial culture were obtained by IMAC 
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chromatography of periplasmic extracts. The single domain product was shown to be highly 
p!ire*and homogeneous by SDS-PAGE (Figure 5). Size exclusion chromatography on a 
Superdex 75 column gave a symmetric single peak at the expected elution position of a 
"nonomeric molecule with a molecular weight of 16 kDa, the molecular weight of Vh (Figure 
7). A preparation of R3A10 gave very high quality NMR data in the absence of detergent, 
confirming the absence of aggregated material (see Figure 8A). 

In general, the protein yields of many dAbs from the A6.1 library were above 5 mg per liter 
of bacterial culture in shaker flasks. Some had yields more than 10 mg and one over 50 mg. 
The solubility of the wild type and the camelized versions were very high as shown by NMR 
studies. R3A10 and M2R2-l(Cys') for example, were soluble in mM concentrations over 
extended periods of time allowing good quality NMR data collection. A NMR structure of a 
human VH camelized in this manner has been described (Reichmann, J Mol, Biol) but in 
order to reduce aggregation and achieve sufficient solubility CHAPS detergent had to be 
added to the sample during NMR data collection. By contrast the A6. 1 dAb molecules 
described here were completely firee of aggregated material in the absence of detergent. 

Conventional antibodies such as those found in human or murine species are 
composed of two identical light chains and two identical heavy chains. The combining sites 
of these antibodies are formed by association of the variable domains of both chains. This 
association is mediated through hydrophobic interactions at the interface. Stmctural and 
biochemical studies have shown that the heavy chain variable domain (Vh) provides most of 
the antigen-contacting residues (Padlan, 1994) (Chothia & Lesk, 1987) (Chothia, Novotny, et 
al., 1985). This finding has formed the basis for the development of single heavy domain 
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antibodies (dAbs) - recombinant antigen binding fragments consisting of only the Vh (Ward, 
Gussow, et aL, 1989) (Cai & Garen, 1996). However, in the absence of their Vl partners, VhS 
have been found to be insoluble, presumably because of the exposed hydrophobic Vl 
interface (Ward, Gussow, et al., 1989). Heavy chain antibodies, found in camelids (Hamers, 
Atarfaouch, et al., 1993) (Sheriff & Constantine,), lack light chains and as a result have 
variable domains that reflect the absence of a Vl partner. Single domain antibodies derived 
from these antibodies are highly soluble and the structural basis of solubility has been 
partially elucidated. First, conserved human/murine interface residues such as Val37, Gly44, 
Leu45 and Trp47 are generally replaced in heavy chain antibodies by tyrosine or 
phenylalanine, glutamate, arginine or cysteine, and glycine, respectively. These mutations 
increase the hydrophihcity of the Vl interface either by non-polar to polar substitutions or, in 
a more subtle way, by inducing local conformational changes (Desmyter, Transue, et al., 
1996) (Spinelli, Frenken, et al.,). This explanation is supported by experiments in which an 
insoluble human Vh was made soluble by introducing the aforementioned mutations at 
positions 44, 45 and 47 (Davies & Riechmann, 1994). Second, in the solved structures of two 
camel dAbs, the CDRSs fold back on the Vh surface, masking a significant surface area of 
the Vl interface (Desmyter, Transue, et al., 1996)(Decanniere, Desmyter, et al., 1999). 

Several other features of VhHs are noteworthy. One is the frequent occurrence of the 
cysteine residues in CDRl and CDR3 (Muyldermans, Atarhouch, et al., 1994) (Lauwereys, 
Arbabi, et al., 1998 (Vu, Ghahroudi, et al., 1997). While the location of the CDRl cysteine is 
typically fixed at position 33, that of the CDR3 cysteine varies. These two residues form a 
disulfide linkage between CDRl and CDR3 (Desmyter, Transue, et aL, 1996) (Davies & 
Riechmann, 1996). In the crystal stmcture of a dAb-lysozyme complex, the disulfide linkage 
imparts rigidity on the CDR3 loop which extends out of the combining site and penetrates 
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deep into the active site of lysozyme (Desmyter, Transue, et al., 1996). A second feature is 
the longer average length of the VhH CDR3, relative to human or murine Vhs (Muyldermans, 
Atarhouch, et al., 1994). A longer CDR3, which is a feature of A6, increases the antigen 
binding surface and, to some extent, compensates for the absence of the antigen binding 
surface provided by the Vl in conventional antibodies (Desmyter, Transue, et al., 1996). A 
third feature is the absence of the CDR3 salt linkage that is typically present in conventional 
antibodies and formed by arginine or lysine residues at position 94 and aspartate at position 
101 (Desmyter, Transue, et al., 1996) (Muyldermans, Atarhouch, et al., 1994) (Spinelli, 
Frenken, et al., 1996) (Davies & Riechmann, 1996) (Chothia & Lesk, 1987) (Morea, 
Tramontano, et al., 1998). 

As antigen binding fragments, dAbs are an attractive alternative to scFvs because of 
their much smaller size and the fact that they demonstrate affinities comparable to those 
demonstrated by scFvs (Ward, Gussow, et al., 1989) (Spinelli, Frenken, et al., 1996) 
(Lauwereys, Arbabi, et al., 1998) (Davies & Riechmann, 1995) (Arbabi, Desmyter, et al., 
1997) (Reiter, Schuck, et al., 1999). Smaller size is an advantage in applications requiring 
tissue penetration and rapid blood clearance. Smaller molecules also offer a tremendous 
advantage in terms of structural studies (Davies & Riechmann, 1994) (Constantine, Goldfarb, 
et al., 1992 (Constantine, Goldfarb, et al., 1993). 

Phage antibody library construction is much simpler and more efficient if single 
domain antibodies are used instead of Fabs or single chain Fvs. Randomization can be 
introduced at a much higher percentage of CDR positions without exceeding practical library 
size. The problem of shuffling original Vl-Vh pairings is also avoided. Camelid phage dAb 
libraries constmcted from the VhH repertoire of camels immunized with target antigens have 
performed well (Arbabi, Desmyter, et al., 1997) (Lauwereys, Arbabi, et al., 1998) 
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(Decanniere, Desmyter, et al., 1999). However, construction of libraries from immunized 
camels presents obvious problems. In addition, the non-human nature of products from these . 
libraries limits their usefulness. Synthetic dAb libraries (Davies & Riechmann, 1995) 
(Reiter, Schuck, et aL, 1999), particularly those based on a human Vh framework, alleviate 
these problems. 

Thus according to another embodiment of the invention, the parental ligand-binding fragment 
has amino acid substitutions at Vl interface which reduce the tendency to aggregation 
attributable to the "stickyness" of the Vh dAb at this interface. In another preferred 
embodiment of the invention, the parental ligand-binding fragment has a long CDR3 similar 
to some camelid antibodies. As discussed above, according to another embodiment of the 
invention, an A6 dAb based library is preferred, because A6 has an unusually long CDR3 of 
23 amino acids. In a particularly preferred embodiment, the library preserves the entire 
length of this CDR3 and at least one of positions 44, 45 and 47 are altered, preferably 44 or 
45 to camelid type residues. In the embodiment exemplified in examples 5 and 6, the CDR3 
was randomized and cysteine residues were introduced at positions 33 and lOOe in the 
expectation that the residues would form the CDR1-CDR3 disulphide bridge present in the 
camel antibody Cab-Lys3 (Desmyter, Transue, et al. 1996). The library was evaluated by 
panning against an IgG that binds a peptide of known sequence. Procedures for the 
construction and testing of this library is described in examples 5 through 13 and below as 
follows. 

A6 dAb is expressed surprisingly well as a soluble product in E. coli with a yield of 
approximately 10 mg per liter of bacterial culture . Mass spectrometry has confirmed that the 
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product has the expected molecular weight. As shown in Figure 6, size exclusion 
chromatography of the product reveals three components that are thought to correspond to 
monomer, dimer and higher oligomer on the basis of their elution volumes (Figure 6A and 
B). However, the monomer peak elutes unusually late suggesting that the dAb is interacting 
non-specifically with the gel matrix possibly through its exposed hydrophobic Vl interface. 
This is a property of human and murine-derived dAb that is not unusual and which has been 
documented previously (Ward, Gussow, et al., 1989) (Davies & Riechmann, 1994). By 
introducing the Gly44Glu, Leu45Arg, and Tyr47Gly mutations (Davies & Riechmann, 1994) 
into the A6 framework, a product is obtained that is exclusively monomer and which elutes at 
the expected volume in size exclusion chromatograms (Figure 6C). 

To generate the template for library construction, the dAb was further modified by 
introducing Val93Ala and Lys94Ala mutations in FR3 and an Ala33Cys mutation in CDRl . 
A preferred library having the itemized substitutions at positions 44, 45, 47 93 and 94 is 
referred to as "A6.r' Libraries with Ala 33 Cys and Cys at position lOOe (see discussion 
below) are temied "A6.1C". These substitutions are diagrammatically illustrated in Figure 9 
and the sequences for A6.1 and A6.1C are shown in Figures 2 and 10, respectively. In 
camelid VhHs, positions 93 and 94 are predominantly occupied by Ala residues and Cys is 
frequently found at position 33 (Muyldermans, Atarhouch, et al., 1994) (Vu, Ghahroudi, et 
al., 1997). The library was constmcted by randomizing 19 amino acids in CDR3, leaving the 
last three residues, PhelOO, AsplOl, and Tyrl02, unchanged (Davies & Riechmann, 1995). 
The degenerate oligonucleotide used for randomization was designed so as to always 
introduce Cys at position lOOe. This was done to facilitate the formation of intra-molecular 
disulfide linkage between lOOeCys and 33Cys in CDR3 and CDRl, respectively 
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(Muyldermans, Atarhouch, et al., 1994) (Desmyter, Transue, et al., 1996) (Davies & 
Riechmann, 1996). 

The library was initially placed in a phagemid vector. Following transformation the 
size of the library was determined to be 2.1 x 10^. Of 80 randomly picked clones analyzed by 
PGR all but one had the dAb insert. In addition, all twenty that were sequenced were unique, 
demonstrating the diversity of the library. To convert the display format from monovalent to 
multivalent, the library was sub-cloned into a phage vector (MacKenzie and To, (1998). 
Following transfomiation, the size of the library was determined to be 6.6 x 10^. Therefore, 
on a random basis each member of the original phagemid library is represented 3 times. 

Initially, the library was panned, in both formats, against 3B1 scFv, which is specific for a 
bacterial carbohydrate (Deng, MacKenzie, et al., 1994). With the phagemid vector format, 
panning failed to enrich for binders and PCR analysis of clones selected at different stages of 
the panning process revealed almost universal deletion of the dAb inserts. However, with the 
phage vector seven different dAbs that bound to 3B1 were identified. As shown in Figure 11, 
these dAbs bound to the target antigen, 3B1, in ELISA experiments and showed no detectable 
binding to the control BSA. In each instance, the consensus sequence was present at the 
extreme C-terminal end of CDR3 (see Table 1 and Table 2 below). 
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Table 1. The CDR3 sequences of dAbs isolated by panning against 3B1 scFv. The 
consensus sequence is shown in bold. 



dAb 


CDR3 sequence 


3B1R3-1 


VGPITGGAPRAVCKHAKAWPLPFDI 


3B1R2-3 


SSQPRVTSSPCVASKSWFLPFDI 


3B1R2-2 


PTTGI RGEKDCTPKKMWRLPFDI 


3B1R2-4 


RDPSVTDTGCCTPRWQAWLPFD I 


3B1R3-3 


PGEPPEASAPCIiRHRVGWLPFDI 


3B1R3-15 


KTVKMRDDEVCTKRTNWLLPFDI 


3B1R3-19 


PGNVASQQNLCGLRATRWLPFDI 



In the phage vector format, the library was also panned against M2 IgG, an antibody 
which was raised against the FLAG peptide DYKDDDDK (Knappik & Pluckthun, 1994). 
More recent studies showed that M2 recognizes the consensus sequence XYKXXD and 
prefers epitopes with aspartate at the first position (Miceli, Degraaf, et al., 1994). Twenty- 
four different dAbs with the FLAG consensus sequence were identified the sequencing of 
clones randomly selected after 3 rounds of panning (Table 2 below). 

Table 2. The CDR3 sequences of dAbs isolated by panning against M2 IgG at different DTT 
concentrations. The FLAG consensus sequence is shown in bold. 
A. No DTT 

VQYGKHRRGSCIEVHPEYKDPDI^ C. 1 xnM DTT 
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NPPKPGAQARCVTTVKDYKEFD I ^ 


TAEPALSPQACMTKERQYKDFDI 


AAIQTETARWCDRHPVSYKMFDI ^ 


QTETQPLYNDC I LRQAGYKWFD I 


QTETQPLYNDC I LRQAGYKWFD I 


AAIQTETARWCDRHPVSYKMFDI 


MHTLQHYRNLCSYQLADYKHFDI^ 


AADPRALMKSCALVTSDYKWFDI 


GLSGSRPNEQCDYKTGDHVQFDI ^ 


LRGRMRQQS CCGGAGNTYKDFD I 


LSGQNYTKTRCLVMQNDYKMFDI^ 


NPPKPGAQARCVTTVKDYKEFDI 


TAEPALSPQACMTKERQYKDFDI^ 


PGPGAPGEYKCEDWSNRQLS FD I 


E rYMYTRGKYCRALSADYKLFDI ^ 


GGLKNQD YKRCDTEGSGFTRFD I 


ESKASRTADQCSGPTPGYKNFDI ^ 




GSQAI KNLSECLVRSDDYKKPDI 


D«10 niM DTT 


GRYFQS KI TSCENNDRDYKLFD I ^ 


DRGPQGAPDPCLQI INDYKTFDI 


PRPARTGHKTCFVRPKNYKDFDI 


YMPAASPVSQCLATLI EYKAFDI 


AEAHSQLP PRCRRKTDE YKI FD I 


KHEVNHVEDRCNQTTETYKMFDI 


SHKTSQPVRNCSATDNSYKLFDI 


LQNS PKNSGWCD F I LAGYKAFD I 


TMGTLHS PHECMKSLVTYKNFD I 


MEPNRSYRGLCLEAPNEYKWFDI 


GRYFQSKI TSCENNDRDYKLFD I 


GDKQS PKSRRCLTWLVGYKHFD I 


ELGWRPRVQACHYSRNDYKYFDI 


PEQRQTVGHVCLTRMPDYKHFD I 


KDVTRTNTVSCSKDRQDYKMFDI 


QHTWARQENGCFMVDYKFSDFD I 


YSATAKWRDKCYEKSRDYKMFDI 




YE I VPF I ASRCVI ERADYKIiFD I 


E.lOO niM DTT 


ADAPNRQKERCWAVHGYKRFDI 


EAESRTWYAPCHSTRTDYKLFDI 


NEEKFSVYSECELYLPTYKMFDI 


TAEPALS PQACMTKERQYKDFD I 


IWEGEKHYAECVTGTKYQPDFDI 


KDVTRTNTVSCSKDRQDYKMFDI 




NWDAKDS PRKCSLMLTMYKDFD I 
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B. 1 /M DTT 


GRT.NHPSOTTCLVSEKEYKSFDI 


NP PKPGAQARCVTT V lUJ x ajs jfux 


GAQYRRLTSSCKPRSHEYKEFDI 


ETPRDTKLiTAClvr WFo-LiXivxr 


RNGNLT YKASC S S AGDD YRDFD I 


7^1 QTETARWCDKrllr V o X ivrar^ x 


CDQTYC S KWLCREEQ VD YKLFD I 


a 17 o li T . Q Pn PMTKERO YKDFD I 


PNERI KEGANCSMGTTEYKQFDI 


T cnnNVTKTRPT.VMONDYKMPDI 


PRSLSTSNATCTTSDYKRHDFDI 


TOT 1DODT.P"PPTPTfiPPRDYKMPDI 




o V IrKX J.UX\^X V^s<xijHOi^iXN-s*« 




TIP A T ."KmTMPP fSPRMS YKWPD I 




MWTT /^WYPMTjPSYOLiADYKHFDI 

£*Ln X XJS^n X r%X>l XJ\m» w x Se J-i*^'^ * *"^*** *^ * 




T . pn P T PMHMLiCNYO S VDYKTFD I 








RANEYGSKSRCTEGMYEYKSFDI 




GAMPQGASRMCAADQREYKAFD I 





^M2R2-l; '^2R2-2; =M2R2-4; '»M2R2-5; *M2R2-9; ^M2R2-10; 9M2R2-13; 
''M2R2-14; ^M2R2-15; 'M2R2-18; ^M2R3-4; ^M2R3-13. 



No consensus sequence other than XYKXXD could be identified. Interestingly, like the 3B1 
binders, all the FLAG consensus sequences occurred in the C-terminal half of CDR3 and 
with two exceptions all occupied identical positions. To ascertain if this observation was 
related to the presence of CDR1-CDR3 disulfide linkage, the reduced version of the same 
Ubrary was also panned against M2 IgG. This was done by the addition of an appropriate 
concentratioii of DTT to the phage mixture during the binding stage of the panning 
procedure. Panning was performed at 1 M, 1 M, 10 M and 100 M DTT. The same 
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concentration of DTT was also included in the wash buffer to maintain the phage dAbs in the 
reduced state. The CDR3 sequences of the dAbs thus isolated are given in Table 2. As in the 
absence of DTT and at all DTT concentrations, the FLAG consensus sequences were located 
near the C-terminal end of CDR3. 

The binding (binding kinetics) to M-2 IgG of five of the dAbs listed in Table 2 (M2R2-2, 
M2R2-4, M2R2-10, M2R2-13 and M2R3-4 were investigated by surface plasmon 
resonance. It was observed that the binding data fit poorly to a 1:1 interaction model in 
all instances, making the derivation of kinetic and affinity constants impossible. When 
binding studies were conducted in the presence of DTT it was observed that the amount 
of binding increased significantly, particularly for M2R2-2. Furthermore, data 
collected in the presence of DTT fit much better to a 1:1 interaction model. In view of 
this result a M2R2 mutant lacking the CDR1-CDR3 disulphide bridge was constructed 
and expressed for BIACORE studies. The data for the binding of this mutant to 
immobilized M2R2-2 IgG (Fig. 11) fit reasonably well to the simple interaction model. 
Global analysis of the data gave an association rate constant of 340 M"' s*^ and a 
dissociation rate constant of 3.4 x 10*^ s'\ From these rate constants the KD of the 
interaction was determined to be 1.1 x 10"^ M. 

NMR studies of R3A10fCvsl and M2R2-1 

Both R3A10(Cys") and M2R2 were soluble up to mM concentration without precipitation or 
aggregation. Figure 8 shows the ^^-^H HSQC spectra for these two proteins. The HSQC 
cross peaks are well dispersed in both proton and dimensions, indicating the proteins are 
folded in solution. Excluding those from the side chain amides, -120 HSQC cross peaks were 
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observed for both R3A10(Cys") and M2R2, which is less than that expeeted. Most of these 
HSQC peaks (>90%) were assigned by using heteronuclear NMR data. The cross peaks 
corresponding to the amides of residues in the CDR3 were found missing in the HSQC 
spectra, which suggests that the CDR3 in both R3A10(Cys') and M2R2-1 is either not 
structured or have multiple conformations in solution. By using a combination of the 
HNCACB and CBCA(CO)NH spectra, most of the backbone (NH, *^C") and side-chain 
^^C^ resonances were assigned for the residues having HSQC cross peaks. The protein 
secondary structure was analyzed using *^C" chemical shifts for the assigned residues 
(Wishart and Sykes, 1994). Most of ^^C** resonances were down-field shifted when compared 
with the corresponding random coil values, suggesting that the proteins are rich in P-strands. 
This is in agreement with the p-stmctures typically formed for immimoglobulin variable 
domains. 

Minimizing the size of antigen binding proteins to a single immunoglobulin domain has been 
one of the primary goals of antibody engineering over the past decade. However, low levels 
of soluble expression in E. coli and solubility problems have hampered development of such 
molecules. The discovery of camelid heavy chain antibodies (Hamers-Casterman et al, 1993) 
opened up new opportunities for development of single domain antibodies, including the 
incorporation of features of these antibodies into human Vh fi'ameworks. Camelization of 
hvunan Vhs is a promising technology for the generation of small antigen binding firagments 
that should be useful for therapeutic purposes in humans. However, while the camelized 
antibodies described in the literature (Davies and Reichmann, 1994; Davies and Reichmann, 
1995; Reichmann and Davies, J. (Biomolecular NMR) have tremendously improved physical 
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properties relative to their non-camelized counterparts, these properties are still less than 
ideal. 

Davies and Reichmann camelized a human Vh by introducing Gly44Glu, Leu45 Arg 
. and Trp47Gly mutations. However, the yields in E. coli of soluble camelized product were 
low (typically less than lmg/1) and in order to obtain the yields and stability required for 
NMR studies they opted for a Trp47Ile mutation instead of the Trp47Gly mutation (Davies 
and Reichmann, 1995). This resulted in yields of up to S mg/1 which is an order of magnitude 
lower than the yields reported here for camelized BT32/A6. A NMR structure of a himian VH 
camelized in this manner has been described (Reichmann, J. Mol Biol) but in order to reduce 
aggregation and achieve sufficient solubility CHAPS detergent had to be added to the sample 
during NMR data collection. By contrast the camelized BT32/A6 molecules described here 
were completely free of aggregated material in the absence of detergent. Size exclusion 
chromatograms showed single peaks at an elution position expected for monomer Vh and 
high quality NMR data was collected in the absence of detergent. 

A6VH displays a number of features that makes it a desirable template for camelized library 
construction. First, both its expression and its solubility are very high, atypical of VHs which 
are derived from conventional four chain antibodies. Second, the protein is mostly existed in 
a monomeric form (Figure 6B). Third, it had an unusually long CDR3 and therefore 
approximates the VhH situation. 

As a template for camelized Vh library construction, BT32/A6 offers the option for 
introduction of a CDR1-CDR3 disulphide bridge. Formation of the disulphide bond was 
confirmed for several sequences and introduction of the two cysteines did not have a negative 
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impact of the yield of soluble product. For M-2 binders the presence of the disulfide imposed 
a constraint that prevented optimal interaction with M-2. In other instances, however, the 
presence of the bridge would probably be advantageous since this is a common feature of 
heavy chain antibodies. Construction and pooling of two libraries, one with and one without 
the bridge, would appear to be advantageous. 

The observation that the consensus sequence recognized by M-2 (XYKXXD) always 
occurred in the C-terminal half of the CDR3 of the M-2 binders is thought to indicate that 
contact residues reside in this portion of the CDR. On a random basis and considering the 
length of the randomized region of CDR3 the consensus sequence should occur at a 
firequency of 4 x 10"^. Consequently, a library with 2x10^ members should contain 500 
independent anti-M2 dAbs displaying the consensus FLAG sequence on CDR3. The 
preferential use of the C-terminus as an antigen contact region is in sharp contrast to an anti- 
lysozyme dAb where all the antigen contacting residues of CDR3 are located at its N- 
terminal half (Desmyter, Transue, et al., 1996), 

It is not surprising that monovalent display using a phagemid vector failed to yield 
binders. Davies and Riechmann (1995) also constructed a camelized dAb library by 
randomizing CDR3 amino acid residues but the library was ten times larger and yielded anti- 
hapten dAbs with dissociation constants in the range of 100-400 nM. However, the isolated 
anti-protein dAbs had weak affinity (Davies & Riechmann, 1995) (Davies & Riechmann, 
1996). Therefore, a smaller library such as the one constructed here may therefore contain 
only weak anti-protein dAbs. The isolation of such dAbs would be difficult with monovalent 
display (Lowman, Bass, et al., 1991). In a phage vector format the dAb are displayed 3-5 
copies and therefore there is potential for avidity which increases the likelihood of isolating 
weak binders (Nissim, Hoogenboom, et al., 1994). 
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In Other preferred embodiments of the invention, the randomized positions in A6 and 
A6.1 libraries are preferably at positions lOOi to lOOn as indicated by the data demonstrating 
binding in the C-terminal regionof the CDR3 loop. 

In addition to CDR3 residues, CDRl positions could be identified for limited 
randomization. Libraries containing shorter and partially randomized CDR3 could be 
constructed and pooled to further increase diversity. 

Figure 15 is a schematic representation of the steps taken to remove the 
recombination site at the 5' end of the A6VH gene. Using the plasmid pSJF-A6VH as 
template and 1 & 3 and 2 & 4 primer pairs, two overlapping fragments were constructed by 
PGR. From these, a larger construct (Fgmtl) was assembled by splice overlap extension 
(SOE) and further amplified by PGR using primers 1 and 2. For simplicity only the part of 
the plasmid spanning from RP (1) primer binding site to FP (2) primer binding site and 
containing the A6VH gene is shown. 3=Ghi.F primer; 4=Ghi.R primer (example 21). 

Additional embodiment of the randomization strategy for the libraries of the invention, are 
described below. 

The present inventors have also found a method of enhancing the probability that the binding 
firagments displayed in the library have characteristics which approximate the desired 
solubility characteristics found in the wild type binding fragment. During construction of the 
library, nucleotides of the variable region are added in a step-wise addition and by selecting a 
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nucleotide ratio which is biased in favor of producing amino acids which reflect the DNA of 
the parental or wild type species. 

Tiius, a method for biasing a library in favor of obtaining selected percentages of wild type 
amino acid residues is achieved by creating residue substitutions by using different spiking 
levels of the various dNTPs as described below. When creating a phage library, the 
randomization of amino acids is often achieved by DNA synthesis. A primer is annealed next 
r A encoding for the variable region, and nucleotides are randomly added to synthesize 
. , . mizsd variable regions. Normally, at the step of synthesizing the DNA used to produce 
the variable region of the phage library, one uses a nucleotide ratio of 1 : 1 : 1 : 1 , which 
generates a totally random variable region. By the present method, during synthesis of the 
variable region, the likelihood of achieving affinity or other desirable traits found in the wild 
type as follows. At each step of adding a nucleotide to the DNA variable region, one selects 
a dNTP ratio which is biased in favor of producing amino acids which reflect the DNA of the 
parental (wild type) species. 

Table 3 charts particular amino acid residues or sequences of residues and preferred types of 
amino acid substitutions according to various examples of the invention to be defined 
hereafter. The selection of amino acids for randomization or partial randomization is based 
on adopting one or more of a variety of approaches including one of more of the following: 

1. universal recognition of wild-type amino acids through a broad-based biasing in 

favour of the wild-type amino acids in one or more regions of interest (approximately 
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10%-90% biasing) in order to maintain the characteristics of the parental Vh ligand- 
bindingmolecule; 

2. selective recognition of amino acids that are important to maintain as wild-type 
through biasing (approximately 10-100%) in order maintain conserved or strategic 
regions of amino acid residues of the parental Vh ligand-binding molecule; and 

3. recognition of selected amino acids as important for intermolecular interaction and 
biasing those amino acids to wild-type and amino acids of the same type. 



Tables 



Amino Acid Residue #s 


Description of Various Preferred Amino Acid Constitutions 


a. At least one of 100a- 
lOOh, preferably at each 
position of lOOa-lOOh 


Randomize; 

At least approximately 10% biasing in favor of wild-type amino 
acids; 

At least approximately 50% biasing in favor of wild-type amino 
acids; 

At least approximately 90% biasing in favor of wild-type amino 
acids; 

Randomize, but bias lOOf to wild-type (approximately 10-100%) 
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b. At least one amino 
acid of: lOOa-lOOb and 
lOOg-lOOh preferably at 
each position of 100a- 
100b and lOOg-lOOh 


Randomize; 

Randomize with bias to wild-type (approximately 10-100%), 
preferably at least approximately 50% wild-type, alternatively at least 
approximately 90% wild-type amino acids;30 Randomize with bias 
to one of the amino acids selected from the group consisting of 
tyrosine, histidine, glutamine, asparagine, lysine, aspartic acid and 
glutamic acid (approximately 10-100%) 


c. At least one of 100b- 
lOOg, preferably at each 
position of lOOb-lOOg 


Randomize; 
Delete; 


d. lOOa-lOOh 


Random additions of up to 10 amino acids; 
Random deletions of up to 7 amino acids; 


e. 95-1000 


Randomize; 

Random additions of up to 10 amino acids; 
Random deletions of up to 7 amino acids; 


f At least one of 95- 
100, preferably at each 
position of 95-100 


Randomize; 

Randomize with bias to wild-type (approximately 10-100%), 
preferably at least approximately 50% wild-type, or preferably at 
least approximately 90% wild-type amino acids; 
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Invariant (primer spans this region) 


g. 101-102 

conserved amino acids 


Invariant (primer spans this region) 
N/A 


h. lOOI-lOOo 


Randomize 

Randomize with bias to wild-type (approximately 10-100%), 
preferably at least approximately 50% wild-type, more preferably at 
least approximately 90% wild-type amino acids; 
Randomize with bias to one of the amino acids selected from the 
group consisting of tyrosine, histidine, glutamine, asparagine, lysine, 
aspartic acid and glutamic acid (approximately 10-100%); 
Randomize with bias to maintaining lOOo as wild-type (10-100%). 


i. At least one amino 
acidof lOOa-lOOb, 
lOOg-lOOh and 1001- 
lOOo, preferably at each 
position of lOOa-lOOb, 
lOOg-lOOh and 1001- 
lOOo 


Randomize with bias to wild-type (approximately 10-100%), 

preferably at least approximately 50% wild-type, more preferably at 

least approximately 90% wild-type amino acids; 

Randomize with bias to one of the amino acids selected from the 

group consisting of tyrosine, histidine, glutamine, asparagine, lysine, 

aspartic acid and glutamic acid (approximately 10-100%); 

Bias to aromatic ammo acids (10-100%) 


j. 95-lOOh 


Randomize but maintain any 5-10 consecutive amino acids as wild- 
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type 


k. lOOa-lOOo 


Randomize but maintain any S-10 consecutive amino acids as wild- 
type 



Unless otherwise necessarily implied as a result of logistical considerations, it is to be 
understood that the various embodiments which relate to choice of amino acids for random, 
biased or fixed substitution (specified in column 1) as well as the various embodiments 
related to types of substitutions (column 2) are not mutually exclusive. Moreover the various 
permutations and combinations of such substitutions are hereby contemplated as 
embodiments of the invention. For example, substitutions referred to in row a. (any one or 
more amino acids and preferably all amino acids of residues 100a - lOOh) #3 (at least 
approximately 50% wild-type anuno acids) may combined with row b. (any one or more and 
preferably all of amino acids residues lOOa, lOOb, lOOg and lOOh) #2 (for instance, at least 
approximately 90% wild-type amino acids) so that, for instance, any 3 of the amino acids in 
lOOa - lOOh are biased in favor of wild-type in approximately 50% of the variant Vh ligand- 
binding fragments and 100a and 100b are biased in favor of wild-type in 90% of potential 
binding fi^gments. 

By necessary implication the three amino acids that are biased in favor of wild-type are not 
residues 100a and 100b, but they may be any other three residues. Accordingly, the broadest 
possible interpretation is to be given to the disclosure of the various combinations and 
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permutations of the embodiments disclosed herein. Furthermore, it is to be understood that 
each of the various embodiments described herein are disclosed, except insofar as logistically 
impossible, in reference to each of the various aspects and definitions of the invention. 
Moreover, it is to be imderstood that phrases such as at least approximately 10%, or 
approximately 10-100% are intended to specify a preference for each of the irnit percentages 
betwem about 7 and 100% that are practically achievable by oligonucleotide primer design 
and PGR amplification described herein below, as well as other well known PGR techniques 
and techniques of Controlled Mutation described in the art, and routine variations of such 
techniques. By the same token, phrases such as at least 80% are intended to specify a 
preference for each of the unit percentages between 80% and 1 00%. It is to be understood 
that biasing of a percentage less than 100% implies unless otherwise implied or stated that the 
remaining percentage is fully randomized. Furthermore, it is to be understood, for example, 
that 90% biasing in favor of wild-type amino acids at a given amino acid position is to be 
approximated by controlling the percentage amounts of each of the three relevant nucleotides 
(so that, for example, the product of the probabilities of occurrence of the three desired 
nucleotides in sequence in the growing chain is 90%) so as to supply 90% of correct coding 
triplet(s) and a total of 10% of random coding triplets, having regard to the degeneracy of the 
genetic code (for example if two different coding triplets result in a given amino acid, then 
the sum of the probabilities of achieving those two triplets will have to equal 90%). This is 
preferably accomplished on an amino acid by amino acid basis so that, for example the 
probability of achieving two and three wild-type amino acids in sequence, in the case of 90% 
biasing is 0.81 and 0.73, respectively, etc. It is to be understood that this high level of biasing 
may be suitable only for part of the coding sequence into which variability is introduced and 
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that higher levels of biasing are acceptable, when for example substantially all of the amino 
acids of a long CDR3 are biased, as disclosed in one of embodiments herein. 

Accordingly there is a balance to be struck between a large diverse library and biasing for 
multifactorial characteristics such as solubility. Nevertheless it is contemplated that the 
library produced may be a pooled library in which several libraries each having varying 
degrees of biasing to wild-type, for example, 60%, 50%, 40% and 30%, are pooled together 
to obtain the both desired variability and similarity. The preferred parental binding-fragment 
may be engineered to maximize the desired characteristic (e.g. solubility, intermolecular 
interaction) and then made the subject of libraries with varying degrees of biasing. In this 
connection, the library could be biased to be rich in amino acids, which are highly soluble. It 
is to be understood that both arms (halves) of the preferred longer loop forming CDR3s may 
be biased to amino acids that are favored for intemiolecular interaction, preferably charged 
amino acids, so as to provide a method of generating, in addition to loop size, varying loop 
structures. This bias may be systematically introduced or systematically reduced by 
randomization, in cooperating pooled libraries having varying degrees of biasing. 

With respect to the application of these methods to parental Vh, preferably, CDR3s of a 
variety of different lengths from 16 to 33 amino acids are predominantly represented among 
the variant Vh Hgand-binding fragments. Preferably CDR3s of a variety of different lengths, 
from 1 8 to 25 amino acids, or, from 1 8 to 23 amino acids are predominantly represented in 
the library. Although the term "predominant" ordinarily implies a majority representation of 
the specified long CDR3 variant Vh Hgand-binding fragments, the invention also 
contemplates an even less substantial representation, especially within a reasonably large size 
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library (>10^). Preferably, the specified long CDR3 variant Vh ligand-binding fragments 
have a majority representation within the library and more preferably an even greater or 
ex:;lusive representation. 

- optionally, the parental Vh ligand-bindingmolecule is reduced in size and the parental Vh 
ligand-binding molecule is optionally modified by deleting a portion of the CDR2. In 
another embodiment, CDR3s of the same length as that of the parental Vh ligand-binding 
raoleculeare predominantly or exclusively represented in the variant Vh ligand-binding 
fragments. 

In another aspect, the CDR3 region is specifically retained along with human sequence 
elements of other regions that confer favorable characteristics solubility, to create a phage 
display library having favorable characteristics of solubility, preferably when compared with 
variant Vh ligand-binding fragments that have fiiUy randomized hypervariable regions 
(particularly CDR3). In particular, the present inventors have found that favorable solubiUty 
characteristics of a parental Vh ligand-binding molecule can be maintained in the population 
of variant Vh Ugand-binding fragments in the course of randomizing the hypervariable 
regions by biasing all or selected amino acids residues to wild-type and/or biasing in favor of 
amino acids residues that favor certain or a variety of types of intermolecular interaction. 
This is respectively accomplished by increasing the percentage amounts of nucleotide bases 
that represent wild-type amino acids and/or amino acids that provide favorable intermolecular 
interactions during the randomization procedure e.g. site directed PCR mutagenesis. 
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Thus, variant Vh ligand-binding fragments having relatively long CDR3s of varying lengths 
are produced by randomly or partially randomly inserting varying numbers of nucleotide 
triplets in any part of a randomized portion of the parental Vh framework. Primers of the 
desired length and nucleotide composition are synthesized followed by PCR amplification. 
Desired randomization can be achieved by biasing nucleotide composition of the primer. The 
production of displays of long CDR3 variant binders may also be accomplished by pooling 
several libraries of variantVH ligand-binding fragments having randomized or partially 
randomized CDR3s of different respective uniform lengths. These strategies are not mutually 
exclusive. 

The additional following terms are used herein as follows, imless the context logically 
implies otherwise: 

"Biasing", *T>iased in favor of and related forms of these terms are generally intended to 
refer to weighting in the. course of introducing variation in the parental ligand-binding 
molecule. 

**Homologous" or '"homology" as used herein refers to "identity" or "similarity" as used in 
the art, meaning relationships between two or more polynucleotide or amino acid sequences, 
as determined by comparing the sequences. In the art, identity also means the degree of 
sequence relatedness between polynucleotide sequences, as the case may be, as determined 
by the match between strings of such sequences. Both identity and similarity can be readily 
calculated (Lesk, A. M., ed.. Computational Molecular Biology^ Oxford University Press, 
New York, 1988; Smith, D. W., ed., Biocomputing: Informatics and Genome Projects ^ 
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Academic Press, New York, 1993; Griffin, A. M., and Griffin, H. G., eds.. Computer 
Analysis of Sequence Data, Partly Humana Press, New Jersey, 1994; von Heinje, G., 
Sequence Analysis in Molecular Biology, Academic Press, 1987; and Gribskov, M. and 
Devereux, J., eds.. Sequence Analysis Primer, M Stockton Press, New York, 1991). While 
there exist a number of methods to measure identity and similarity between two 
polynucleotide sequences, both terms are well known to skilled artisans (von Heinje, G., 
1987;Gribskov, M. and Devereux, J., 1991; and Carillo, H., and Lipman, D., 1988). Methods 
commonly employed to determine identity or similarity between sequences include, but are 
not limited to those disclosed in Carillo, H., and Lipman, D. (1988, SLAM J, Applied Math., 
48: 1073). Methods to determine identity and similarity are codified in computer programs. 
Preferred computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., et al.(1984). 
Nucleic Acids Research 12(1): 387), BLASTP, BLASTN, and FASTA (Atschul, S. F. et 
al.(1990), J. Molec. Biol, 215: 403). "Percent homology" or "% homologous" or related 
terms include both of the following interpretations / methods of calculation: 1) an 
approximate percentage of the sequence referenced in terms of the number of common 
residues (e.g. 80% of 1 1 is imderstood to be an approximation insofar as application of the 
percentage does not yield a unit number of residues, in which case both the immediately 
higher number and immediately lower unit numbers, 9 and 8 respectively, are deemed to be 
covered); 2) the percentage of binding fragments theoretically achievable that have the fiiU 
wild-type sequence, which is calculated as a product of the probabilities that the wild-type 
amino acid will occur at a given amino acid position. 
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"Conserved" regions refer to those which are commonly found in at least other antibodies of 
the same type or in at least the same species of mammal. 

"Wild-type" refers to the parental binding-fragment, which may be a variant of the natural or 
to the native A6 Vh parental ligand-binding fragment, depending on the context. 

"Step-wise" refers to the addition of, for example, nucleic acids, in a manner such that the 
quantity of nucleic acids added at each step is rigorously control, usually one nucleic acid at a 
time. 

"Spanning" does not preclude deletions or additions within the parental Vh binding-fragment 
that are not inimical to the operation of the invention. 

"Camelid type" refers specifically to one or more features of the camelid Vl interface, 

"Soluble" includes the generally ascribed meaning in the art and without limitation includes 
(based on solubility correlated phenomena) the relative amounts of naturally-folded 
recombinant protein released from the cell. 

"Percent biasing" or "% of binding fragments" (or "biasing 10-100%", etc.) refers to biasing 
on an individual amino acid basis (though other techniques to accomplish the same effect 
might apparent to those skilled in the art). Similarly, the specification that wild-type amino 
acids occur at a specified position or series of positions in, for example, at least 
approximately 50% of potential binding fragments is intended to mean both that 50% biasing 

42 



SUBSTITUTE SHEET (RULE 26) 



wo 01/18058 



PCT/CAOO/01027 



is sought at a given such position or that a total of 50% of the correct nucleotide triplets are 
represented. 

"Approximately" in reference to percentages is intended to accommodate attrition of various 
desired variant Vh ligand-binding fragments, the assumption that the probabiUstic outcomes 
will not be achieved in practice and that certain variation in methods to accomplish the 
specified results is deemed to be suitable. The term 50% in reference to an uneven number of 
amino acids residues means that either one more or one less than half of the amino acids is 
referred to. 

The practice of the present invention employs, unless otherwise indicated, conventional 
techniques of molecular biology (including recombinant techniques), microbiology, cell 
biology, biochemistry and immunology, which are within the skill of the art. Such 
techniques are explained fiilly in the literature, such as, "Molecular Cloning: A Laboratory 
Manual", second edition (Sambrook et al., 1989); "Oligonucleotide Synthesis" (M.J. Gait, 
ed„ 1984); "Animal Cell Culture (R.I. Freshney. ed., 1987); "Methods in Enzymology" 
(Academic Press, Inc.); "Handbook of Experimental Immunology" (D.M. Wei & C.C. 
Blackwell, eds.); "Gene Transfer Vectors for Manmiahan Cells" (J.M. Miller & M.P. Calos, 
eds., 1987); "Current Protocols in Molecular Biology (P.M. Ausubel et al., eds., 1987); "PCR: 
The Polymerase Chain Reaction", (Mullis et al., eds,, 1994); "Current Protocols in 
Inmiunology" (J.E. Coligan et al., eds., 1991). These references are incorporated herein by 
reference. These techniques are applicable to the production of the polynucleotides and 
polypeptides of the invention, and, as such, may be considered in making and practicing the 
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invention. Particularly useful techniques for particular embodiments will be discussed in the 
sections that follow. 

Recombinant genetic techniques have allowed cloning and expression of antibodies, 
functional fiagments thereof and the antigens recognized. These engineered antibodies 
provide novel methods of production and treatment modalities. For instance, functional 
immunoglobulin fragments have been expressed in bacteria and transgenic tobacco seeds and 
plants. Skerra (1993) Curr.Opin, ImmunoL 5:256:262; Fiedler and Conrad (1995) 
Bio/Technology 13:1090-1093; Zhang et al. (1993) Cancer Res. 55:3384-3591; Ma et al. 
(1995) Science 268:916; and, for a review of synthetic antibodies, see Barbas (1995) Nature 
Med, 1 :836-839. These and more current references describing these techniques, which these 
references, particularly those well known to persons practicing in the relevant arts, are hereby 
incorporated herein by reference. 

Suitable parental binding-fragments include any known in the art and include the group 
consisting of an scFv, Fab, Vh, Fd, Fabc, F(ab')2, F(ab)2 derived from A6. 

Nucleotide sequences can be isolated, amplified, and processed by standard recombinant 
techniques. Standard technique in the art include digestion with restriction nucleases, and 
amplification by polymerase chain reaction (PGR), or a suitable combination thereof. PGR 
technology is described m U.S. Patent Nos. 4,683,195; 4,800,159; 4,754,065; and 4,683,202, 
as well as PCR: The Polymerase Chain Reaction, MuUis et al., eds., Birkauswer Press, 
Boston (1994). 
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In addition to the specific PCR methods of biasing to wild-type A6 amino acid residues 
detailed below, it is possible to produce multiple different oligonucleotide primers consisting 
of specified amino acid residues (one or more) of the wild-type A6 molecule (e.g. CDR3 
residues), mixing these in appropriate concentrations with a completely randomized (e.g. 
CDR3) oligonucleotide primer and subjecting the mixture of oligonucleotide primers to PCR. 
This will result in a biased phage library population of one's choosing (i.e. the amounts of the 
selectively randomized and totally randomized primers in the mixture will determine the per 
cent of each CDR3 representation in the library). 

Polynucleotides comprising a desired sequence can be inserted into a suitable vector, and the 
vector in turn can be introduced into a suitable host cell for replication and amplification. 
Polynucleotides can be introduced into host cells by any means known in the art. Cells are 
transformed by introducing an exogenous polynucleotide by direct uptake, endocytosis, 
transfection, f-mating or electroporation. Once introduced, the exogenous polynucleotide can 
be maintained within the cell as a non-integrated vector (such as a plasmid) or integrated into 
the host cell by standard methods. See, e.g., Sambrook et al. (1989). RNA can also be 
obtained fi-om transformed host cell, or it can be obtained directly from the DNA by using a 
DNA-dependent RNA polymerase. 

Suitable cloning and expression vectors include any known in the art, e.g., those for use in 
bacterial, manunalian, yeast and insect expression systems. Specific vectors and suitable host 
cells are known in the art and are not described in detail herein. See e.g. Gacesa and Ramji, 
Vectors, John Wiley & Sons (1994). 
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Phage display techniques are generally described or referenced in some of the preceding 
gv-aeral references, as well as in U.S. Patent Nos. 4,593,002; 5,403,484; 5,837,500; 
5 571,698; 5,750,373; 5,821,047; 5,223,409 and 5,702,892. "Phage Display of Peptides and 
:*^roteins", (Kay, Brian K. et al., 1996); "Methods in Enzymology", Vol. 267 (Abelson, John 
N., 1996); "Immunology Methods Manual", (Lefkovits, Ivan, 1997); "Antibody phage 
display technology and its applications", (Hoogenboom, Hennie R. et al., 1998). 
Immunotechnology 4 p. 1-20; Cesareni G et al. Phage displayed peptide libraries. Comb 
Chem High Throughput Screen, 1999 Feb;2(l):l-17; Yip, YL et al. Epitope discovery using 
monoclonal antibodies and phage peptide libraries. Comb Chem High Throughput Screen. 
1999 Jun;2(3): 125-38; Rodi DJ et al. Phage-display technology—finding a needle in a vast 
molecular haystack. Curr Opin BiotechnoL 1999 Feb;10(l):87-93. 

Generally, DNA encoding millions of variants of a parental binding-fragment can be batch- 
cloned into the phage genome as a fusion to the gene encoding one of the phage coat proteins 
(pin, pVI or pVin). Upon expression, the coat protein fusion will be incorporated into new 
phage particles that are assembled in the bacteriiun. Expression of the fusion product and its 
subsequent incorporation into the mature phage coat results in the ligand being presented on 
the phage surface, while its genetic material resides within the phage particle. This 
connection between ligand genotype and phenotype allows the enrichment of specific phage, 
e.g. using selection on immobilized target. Phage that display a relevant ligand will be 
retained, while non-adherent phage will be washed away. Bound phage can be recovered 
from the surface, reinfected into bacteria and re-grown for further enrichment, and eventually 
for analysis of binding. The success of ligand phage display hinges on the combination of 
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this display and enrichment method, with the synthesis of large combinatorial repertoires on 
phage. 

While the use of phage is described as an embodiment for the production of libraries for 
displaying, and selecting particular binding fragments, it is to be understood that and suitable 
genetic package may be used for the production of libraries of the invention. Such suitable 
genetic packages include cells, spores and viruses (see US Patent No. 5,571,698), or any 
other suitable replicable genetic packages. With respect to cell based approaches, another 
popular method of presenting a library is the two-hybrid system (Feilds and Stemglanz, 1994, 
Trends in Genetics 10:286-292). Those skilled in the art will appreciate that in vitro systems 
(non-cell based) may be equally applicable to the methods of the present invention, for 
example ribosome display (Hanes et al., 1998) or RNA-peptide fusion (Mattheakis et al., 
1994, Proc Natl Acad Sci USA 91 :9022-26; Hanes et al., 1999, Curr Top Microbiol Immunol 
243:107-22). 

Ribosome display is a well documented technique that may be useful for generating libraries. 
This entirely in vitro method allows for libraries with a diversity of > 10^^. In this method, a 
peptide is displayed on the surface of a ribosome that is translating it. Briefly, a library of 
niRNA molecules (we could start with A6) is translated in vitro translation system to the 3* 
end, such that the ribosome does not fall off. The protein emerges from the ribosome in such 
a way that it can fold, but does not fall off. In some instances, there is an additional folding 
step in an oxiding environment (important for proteins with disulfide bonds). The whole 
complex of folded protein, ribosome and mRNA, which is stable for several days, can then be 
panned against a ligand that is recognized by the translated protein. (For example, the 
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translated protein could be an antibody and the ligand is its antigen). The mRNA can then be 
amplified by reverse transcription and PGR. This technique has been used to successfully 
generate scFv antibody fragments with high affinity for their target. Reference is made to 
HanesJ., Jennutus,L., Weber-Bomhauser,S-» Bosshard,H.R. & Pluckthun,A. Ribosome 
display efficiently selects and evolves high-affinity antibodies in vitro from immune libraries. 
Proc. Natl Acad. ScL USA 95, 14130-14135 (1998); Schaffitzel,C,, HanesJ., Jermutus,L. & 
Pluckthun,A. Ribosome display: an in vitro method for selection and evolution of antibodies 
from libraries. Journal of Immunological Methods 231, 1 19-135 (1999); He>l. et al. 
Selection of a human anti-progesterone antibody fragment fix>m a transgenic mouse library by 
ARM ribosome display. Journal of Immunological Methods 231, 105-1 17 (1999); 
RobertsJR. W. Totally in vitro protein selection using mRNA-protein fusions and ribosome 
display. Current Opinion in Chemical Biology 3, 268-273 (1999); Williams,C. Biotechnology 
match making: screening orphan ligands and receptors. Current Opinion in Biotechnology 11, 
42-46 (2000); Mattheakis,L.C., Bhatt,R.R. & Dower, W.J. An in vitro polysome display 
system for identifying ligands from very large peptide libraries. Proc. Natl. Acad. ScL USA 
91,9022-9026(1994). 

Example 1 - Constniction of Single-domain A6-based (A6-based dAb^ DNA Templates 

To facilitate construction of the A6-based dAb libraries, a Nhel site was introduced at the 
amino acid residues 24-25 (nucleotides underlined and bolded in Figure 14) while 
maintaining the wild-type amino acid sequence. Briefly, the A6 Vh gene was used as a PCR 
template to amplify a shorter internal fragment employing the primers A6VH/NheI- 
5'(TGTTCAGCTAGCGGATTC)3' and A6VH/BstEn- 
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5'(TGAGGAGACGGTGACCGTTGTCCCTTGGCCCCAGATATCAAA)3\ These primers 
incorporate Nhel and BstEQ sites (underlined) at the and 3' ends of the amplified product. 
PCR (polymerase chain reaction) was performed in a total volume of 50 :1 containing 200 
mM each of the four dNTPs, 100 pmol each of the two primers, 5 1 of lOX buffer (New 
England Biolabs (NEB)), and 2 units of Vent DNA polymerase (NEB). 

The amplified product was purified using QIAquick PCR Purification kit™ (QIAGEN, 
Mississauga, ON), digested with Nhel and BstEH restriction endonucleases and subsequently 
ligated to the NheVBstEil-r^sincX^ pSJFl-10A12 vector derived fi-om pUC 8 (Narang et al., 
1987) to replace a portion of the existing A6 Vh gene. To constract the pSJFl vector, the 
pUC 8 plasmid (Vierra and Messing, 1982; Messing, 1983) was modified by inserting the 
OmpA signal sequence and the Hiss-carboxy tail between the EcoRI and Hindlll restriction 
sites of the pUC 8 polylinker region, using oligonucleotide primers and PCR (Narang et al., 
1987). 

Electro-competent E.coli TGI cells were prepared (Tung and Chow, 1995) and an aliquot of 
the ligated product was used to transform the cells. Transformation was carried out using the 
BIO-RAD Gene Pulser™ (Bio-Rad Laboratories, Mississauga), ON according to the 
manufacturer's instructions and the clone harbouring the mutated A6 dAb gene was 
confirmed by sequencing (Sanger, F. et al.. 1977) using the AmpliTaq DNA Polymerase FS 
kit and 373 A DNA Sequencer Stretch (PE Applied Biosystems, Mississauga, ON). All the 
cloning steps were perforaied as previously described (Sambrook et al. 1989). The resulting 
vector is termed pSJFl-A6VH.Mid. 
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Example 2 - A6 dAb Library Construction 

The steps of A6 dAb library construction involved a series of sequential PCR experiments. 

(1) Introduction of restriction sites to facilitate cloning: To amplify the target DNA, the 
PCR mixture was first incubated at 95'^C for 5 min, then subjected to 30 cycles of: 30 sec 
at 94°C, 1 min at 40''C and, I min at IT'C. The A6VH.Mrrf-containing plasmid, pSJFl- 
A6VH.A/ZieI, was used as the template in PCR to ampUfy a shorter fi-agment using the 
primers ASVWApaa - 5' (CATGACCACAGIQQACAGGAGGTCCAGC- 
TGCAGGAGTC) 3' and A6VH.FR3.F - 5' (TTTCACACAGTAATACAC) 3'. The 
PCR mixture contained 200 M each of the four dNTPs, 0.2 pmol/ 1 each of the two 
primers, IX buffer (Perkin Elmer), and 0.05 units/ 1 of AmpliTaq DNA polymerase 
(Perkin Elmer). (The fomier primer also introduces Apall site at the 5' end of the PCR 
product.) 



Example 3 - Randomization of the A6dAb CDR3 residues: 

The ampUfied firagment firom step (1) was purified by QIAquick Gel Extraction kit™ 
(QIAGEN) and subsequently used as the template in a second PCR reaction using the 
primers ASVliApall and A6VH.RndmCDR3.F - 5* (GCCCCAGATATCAAA20 
[((A/C)NN)]TTTCACACAGTAATA)3'. At the protein level the second primer results in 
the randomization of the first 20 residues in CDR3. The PCR mixture was identical to above 
except that the concentration of the primers was increased to 0.5 pmol/l to ensure that 
sufficient amounts of oligonucleotide primers and dNTPs were provided for the generation 
of a large randomized library. 
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Example 4 - Addition of a Notl restriction site, ligation to the phage vector and 
library construction. 

The amplified fragments were purified as above and used as templates in a third round of 
PGR employing 2 pmol/ul each of the two primers A6VtUApall (described above) and 
A6VH.NotI.EXT.F - 5YCGATTC TGCGGCCGCT GAGGAGACGGTGACCGTT- 

GTCCCTTGGCCCCAGATATCAAA) 3\ (The latter primer incorporates the Notl site 
(underlined) at the 3' end of the amplified products.) The amplified Augments were purified 
using QIAquick PGR Purification kit™ (QIAGEN), digested with Apall and Notl, and 
ligated to ApalUNotl-digested fd-tet phage vector (McAflFerty et al., 1990; Zacher et al., 
1980). The ligated product was desalted using QIAquick PGR Purification kit™ (QIAGEN). 
To determine the size of the library, immediately following the transformation and after the 
addition of the SOG medium (per L: bacto-tryptone, 20 g; bacto-yeast extract, 5 g; NaGl, 0.5 
g; glucose, 3.6 g) a small aliquot of the electroporated cells were serially diluted in 
exponentially growing E. coli strain TGI cells. Two hundred \i\ of the diluted cells were 
mixed with 3ml of 50°C top agar and immediately poured onto 2xYT (per L: bacto-tryptone, 
16 g; bacto-yeast extract, 10 g; NaGl, 5 g) plates pre-warmed to 37**G. Plates were incubated 
overnight at 37°C and the number of plaques were used to determine the size of the library. 
Following this, the DNA inserts from single plaques were ampUfied using PGR. The size of 
the amplified product, determined by agarose gel electrophoresis, was used to determine the 
fraction of the library with fiiU-sized A6 dAb inserts. Diversity of the library was 
determined to be in the range of lO'-lO*. 
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The recombinant phage vectors, 1.5 ng, were mixed with 40 jil of competent E. coli strain 
TGI and the cells were transformed by electroporation. Following transformation, 1 ml of 
SOC medium was added to each electroporation mixture (45 ml in total). The mixture was 
divided into three equal aliquots, each of which were added to tubes containing 3 ml of top 
agar at 50**C, vortexed immediately, poured onto pre-warmed 2xYT agar plates, and 
incubated at 37°C overnight. Five ml of sterile PBS (per L: NaCl, 8 g; KCl, 0.2 g; Na2HP04, 
1 ,44 g; KH2PO4, 0.24 g; pH 7.4) was added to the plates and the phage particles were eluted 
by gently shaking the plates at 4*'C for 3 hr. The phage-containing PBS supematant was 
collected, the plates rinsed with an additional 5 ml of PBS and the two supematants were 
pooled. The supematants were centrifuged at 6000g for 15 min at 4**C, the cleared 
supematant decanted and the phage were purified as described by Harrison et al. (1996). 
The phage pellet was dissolved in 20 ml of sterile PBS, divided into 100 jil aliquots and 
stored in liquid nitrogen. 



Kitample 5 - Partial Constructio n of A6.1 analogue 



A6 dAb was constructed from the heavy chain variable domain (VH) of the A6, an anti-tumor 
IgM with unidentified antigen (Dan et al.. 1995). Briefly, the dAb gene was amplified by 
polymerase chain reaction (PGR) using the primers: 
HI 1MB, S'fTA TGGATCCT GAGGAGACGGTGACCGT)3'; and 

A6VH. , 5'(TATGAAQACACCAGGCCGAGGTCCAGCTGCAGGAG)3' which contain 
the Bamm and Bbsl sites (underlined). PGR was performed in a total volume of 50 1 
containing 200 M each of the four dNTPs, 100 pmol each of the two primers, 5 1 1 OX 
buffer (NEB), and 2 units of Vent DNA polymerase (NEB). The amplified product was 
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purified using QIAquick PGR Purification kit™ (QIAGEN), digested with BamHl and Bbsl 
restriction endonucleases, and subsequently ligated to the expression vector pSJF2 (Simon J. 
Foote, personal communication). Transformation was performed as described in Example 1. 
Tc modify A6 dAb, the vector containing A6 dAb gene (pSXF2-A6dAb) was used as template 

amplify two overiapping 5' and 3' Augments. The 5' fi-agment was amplified using the RP 
:>rimer 5'(GCGGATAACAATTTCACACAGGAA)3' and A6.1dAb analogue .bk primer 
5'(AGCCTGGCGGACCCAGTGCATAGCATAGCTACTGAAGGTGAATCCGCTAGCTG 

^A.C!\GGAGAGTCT)3\ The 3* firagment was amplified using the FP primer 
iXCCAGGGTTTTCCCAGTCACGAC)3' and the mutagenic primer A6.1 dAb analogue fw, 
5'(TGGGTCCGCCAGGCTCCAGGGAAGGAACGTGAAGGTGTTTCAGCTATTAGT)3'. 
(At the protein level the b old codons in the mutagenic primer introduce Glu, Arg, and Gly 
at positions 44, 45, and 47, respectively). The two fi-agments were gel purified using the 
QIAquick Gel Extraction kit™ (QIAGEN), and a larger constmct was assembled fi-om the 5' 
and 3' fragments by performing splice overlap extension (Clackson, et al., 1991). Briefly, the 
reaction vial containing both 5' and 3' fragments, 200 M each of the four dNTPs, 5 1 1 OX 
buffer (NEB), and 2 units of Vent DNA polymerase (NEB) was subjected to 7 cycles of 1 
min at 94°C and 2.5 min at 72 To amplify the assembled constmct, RP and FP primers 
were added to a final concentration of 10 pmol/ 1 and the mixture was subjected to 25 cycles 
of 1 min at 94 ° C, 1 min at 55 ° C, and 1 min at 72 C. The amplified product was gel 
purified, digested with EcoRl and HindlU^ purified again, and ligated to the EcoRVHindSi 
restricted pSJF2-A6dAb. An aliquot of the ligated product was used to transform the TGI 
cells and the clone harboring the A6.1 dAb was identified by sequencing. As expected, at the 
protein level the A6.1 dAb had acquired the following three mutations: Gly44Glu, Leu45Arg, 
and Tyr47Gly (Davies and Riechmann, 1994) 
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Example 6 - C nstruction of the A6.1 dAb library using phagemid vector. 

As the first step, the camelized dAb gene was used as the template in PCR to amplify a 
shorter fragment. PCR was performed as described above using the two mutagenic primers 
A6VH.33C, 

5YTGTTC AGCTAGCG GATTCACCTTCAGTAGCTATTGTATGCACTGGGTCCGC)3' 
containing the Nhel site (underlined) and A6VH. A, 

5'(TGCTGCACAGTAATACACAGCCGT)3*. (At the protein level the bold codons in 
the mutagenic primers introduce cysteine and two alanine residues at positions 33, 93, and 
94, respectively). The mutated fragment was used as the template in a second PCR using the 
primers A6VH.33C and A6VH.100eC, 

5'(GCCCCAGATATCAAA[(A/C)NN]9GCA[(A/C)NlSnioTGCTGCACAGTA^ The 
second primer results in the randomization of 19 residues in CDR3 and introduces a cysteine 
at position lOOe. The amplified fragments were used as the templates in a third round of PCR 
employing the primers A6VH33C and A6VH.BstEn, 

5XTGAGGAGACGGTGACCGTTGTCCCTTGGCCCCAGATATCAAA)3*. (The latter 
primer incorporates the BstEH site (underlined) at the 3' end of the amplified products.) PCR 
was performed as above using 0.5 pmol of template and 100 pmol of each of the two primers. 
The amplified fragments were purified, digested with Nhel and BstEQy and ligated to 
NheVBstEH-txezXed pSJF6-A6dAb phagemid (Simon J. Foote, personal comunication). The 
ligated product was desalted using QIAquick PCR Purification kit™ (QIAGEN) and used to 
transform E, coli strain XL 1 -Blue. Various dilutions of the transformation mixture was 
spread on LB/ampicillin plates and incubated overnight. In the morning, the number of 
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ampicillin resistant colonies were used to calculate the size of the library. Following this, 
single colonies were suspended in PCR mixture, and the DNA inserts were amplified. The 
size of the amplified product, determined by agarose gel electrophoresis, was used to 
determine the fraction of the library with full-size dAb insert. Diversity of the library was 
determined by sequencing 20 dAb genes fi-om the library. Growth of the library was 
performed as described (Harrison et al., 1996). 

Example 7 "Subcloning the library in the pahge vectorector 

As the initial step of sub-cloning, 1 80 pmol of the library phagemid DNA template and 100 
pmol of each of the two primers A6VH.ApalI, 

5'(CATGACCACAG1GCACAGGAGGTCCAGCTGCAGGAGTC)3* and A6VH.NotI 
S'fCGATTC TGCGGCCGCT GAGGAGACGGTGACCGTTG)3' were used in PCR to 
amplify the dAb genes. The primers are complimentary to the 5' and 3' ends of the dAb 
genes and incorporate Apall and Notl restriction sites (underlined sequences) at the end of the 
amplified genes. The amplified products were purified, cut sequentially with Apall and Notl 
restriction endonucleases, purified again, and hgated to the Apall/Notl-tic3icd fd-tet phage 
vector. Following this, 1.5 g of the desalted ligated product was mixed with 40 1 of 
competent £, coli strain TGI and the cells were transformed by electroporation. 
Transformation, library phage amplification and purification and library size determination 
were performed as in Example 4. 

Example 8 - Library size determination. 
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To determine the size of the library, immediately following the transformation and after the 
addition of the SOC medium an small aliquot of the electroporated cells were serially diluted 
in exponentially growing TGI cells. Two hundred 1 of the diluted cells was mixed with 3 
ml of 50 C agarose top and immediately poured onto 2xYT plates pre- warmed to 37 C. 
Plates were incubated overnight at 37°C and the number of plaques were used to deteraiine 
the size of the library. 

Example 9 - Panning 

Panning was performed using the Nunc-Immimo MaxiSorp™ 8-well strips (Nunc). Briefly, 
the wells were coated overnight by adding 150 1 o f 100 g/ m 1 antigen in PBS. In the 
morning, they were rinsed three times with PBS and subsequently blocked with 400 1 P B S - 
2% (w/v) skim milk (2% MPBS) at 37 C for 2 hr. The wells were rinsed as above and 10*^ 
transducing units phage in 2% MPBS were added. The mixture was incubated at room 
temperature for 1.5 hr after which the unbound phage in the supernatant was removed. The 
wells were rinsed 10 times with PBS-0.1% (w/v) Tween 20 and then 10 times with PBS to 
remove the detergent. The bound phage was eluted by adding freshly prepared 200 1 1 00 
mM triethylamine, pipetting the content of the well up and down several times and incubating 
the mixture at room temperature for 10 min. The eluted phage was transfered to a tube 
containing 100 1 1 M Tris-HCl, pH 7.4 and vortexed to neutralize triethylamine. Following 
this, 10 ml exponentially growing TGI culture was infected with 1 50 1 e luted phage by 
incubating the mixture at 37 C for 30 min. Serial dilutions of the infected cells were used to 
determine the titer of the eluted phage as described in the previous section. The remaining of 
the infected cells were spun down and then resuspend in 900 I 2 x YT. The cells were mixed 
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in 300 1 aliquots with 3 ml agarose top and the phage propagated on the plates overnight at 
37 ""C. In the morning the phage was purified, the titer was determined, and a total of 10*' 
transducing units phage were used for further rounds of selectio 

Example 10 - Expression and Purification. 

Thirty ml of LB containing 100 ug/ml ampicillin was inoculated with a single colony 
harboring pSJF2-dAb and the culture was shaken at 240 rpm at 37°C overnight. In the 
moming the entire overnight culture was used to inoculate 1 liter of M9 medium 
supplemented with 5 g/ml vitamin Bl, 0.4% casamino acid and 100 g/ml ampicillin. The 
culture was shaken at room temperature for 30 hr at 160 rpm and subsequently supplemented 
with 100 ml of lOx induction medium and 100 ul of IM isopropylthio— D-gal a ctoside. The 
culture was shaken for another 60 hr, the periplasmic firaction was extracted by osmotic shock 
method (Anand et al., 1991), and the presence of dAb in the extract was detected by Western 
blotting (MacKenzie 1994). The periplasmic firaction was dialyzed extensively in 10 mM 
HEPES (N-[2-hydroxyethyl]piperazine-N - [2-ethanesulfonic acid]) buffer pH 7.0, 500 mM 
NaCL The presence of the dAb C-terminal Hiss tag allowed a one step protein purification 
by immobilized metal affinity chromatography using HiTrap Chelating™ colunm 
(Phamacia). The 5-ml column was charged with Ni^^by applying 30 ml of a 5 mg/ml 
NiCl2.6H20 solution and subsequently washed with IS ml deionized water. Purification was 
carried out as described (MacKenzie, 1994) except that the starting buffer was 10 mM 
HEPES buffer, 10 mM imidazole, 500 mM NaCl, pH 7.0, and the bound protein was eluted 
with a 10-500 mM imidazole gradient. The purityof the protein was determined by SDS- 
PAGE (Laemmli). To detect the presence of dimer/multimer dAb in the protein preparation, 
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gel filtration chromatography was performed using Superdex75 (Pharmacia) as described 
(Dengetal., 1995). 

Example 11 - Alkvlation reactions 

Alkylation reactions were performed using iodoacetic acid. Briefly, 5x vol. cold acetone 
were added to 200 g of dAb solution and the contents were mixed, followed by 
centrifiigation in a microfuge at maximum speed at 4^C for 10 min. The pellet was dissolved 
in 500 1 of 6 M guanidinium hydrochloride and 55 1 of 1 M Tris buffer, pH 8.0 were 
added. Subsequently, a 25 molar excess of DTT, relative to Cys residues, was added and the 
mixture was incubated at room temperature for 30 min. To this, a 2.2 molar excess, relative 
to DTT, of freshly-made iodoacetic was added and the reaction was incubated as described 
above. At the end of incubation, the alkylated product was concentrated and dissolved in 50 
1 of distilled water using Ultrafree-MC 10,000 NMWL filter unit according to the 
manufacturer's instructions (Millipore, Nepean, ON, Canada). Control experiments were 
identical except that DTT was replaced with water. The MWs of the iodoacetic acid-treated 
dAbs were determined by mass spectroscopy. 

Example 12 - Surface Plasmon Resonance 

Binding studies were performed using BIACORE Upgrade (Biacore Inc., Piscataway, NJ) as 
described (Jonsson et al., 1991). Approximately 14, 000 RU of anti-FLAG M2 IgG or 
control IgG were immobilized on CM5 sensor chips by amine coupling. Single-domain 
antibodies were passed over the sensor chips surfaces in 10 mM HEPES buffer, pH 7.4, 1 50 
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mM NaCl, 3.4 mM EDTA, 0.005% P-20 (Biacore Inc.) at 25°C and at a flow rate of 5 jil/min. 
To assess the effect of DTT on the dAb binding to M2, dAbs were incubated with DTT prior 
to injection and the above buffer was supplemented with appropriate amount of DTT, 
Surfaces were regenerated with 10 mM HCl. Sensorgram data were analyzed using the 
BIAevaluation 3.0 software package (Biacore Inc.). 

Example 13 - Enzyme-Linked Immunosorbent Assay (ELISA^t 

Nunc-Immuno MaxiSorp™ plates (Nunc) were coated ovemight at A^C with 150 1 o f 10 
g/ml of 3B1 scFv or BSA in PBS. The contents were removed and the plates were tapped 
on a paper towel to remove any liquid remaining in the wells. The wells were blocked by 
adding 300 ^1 of 2% MPBS and incubating for 2 hr at 37°C. The contents of the wells were 
emptied as before, 100 1 of purified dAb phage in 2% MPBS was added, and the wells were 
incubated at room temperature for 1.5 hr. The contents were emptied again and the wells 
were washed 5 times with PBS-0.05% (v/v) Tween 20 and subsequently blotted on a paper 
towel to remove any remaining wash buffer. One Himdred 1 of recommended dilution of 
HRP/Anti-M13 monoclonal antibody conjugate (Amersham Pharmacia Biotech) in 2% 
MPBS was added and the wells were incubated at room temperature for 1 hr. The wells were 
washed six times as before and the binding of dAb to the antigen was detected 
colorimetrically by adding 100 1 of equal mixtures of TMB Peroxidase Substrate and H2O2 
(Kirkegaard and Perry Laboratories, Gaithersberg, MD, USA) at room temperature for 
several minutes. The reaction was stopped by adding 100 1 of 1 M H3PO4 and the A450 was 
measured by DYNATECH MR5000 plate reader (Dynatech Laboratories, Chantilly, VA, 
USA). 
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Example 14 - NMR studies 
Sample preparation 

Isotopically labeled proteins were prepared from cells grown on and/or *^C-enriched 
media (Bio-Express, Cambridge Isotopes Laboratory, Andover, MA), Briefly, six ml of LB 
containinglQO ug/ml ampicilin was inoculated with a single recombinant colony and 
incubs' ?d at 37**C and 260 rpm to an A6oo of about 5. The cells were centrifuged and then re- 
suspend in 3 ml sterile PBS and the A600 was measured. The cells were added to twenty- 
five ml of Bioexpress/100 ug/ml ampicilin in sterile 125 ml Erlenmyer flasks to a final 
concentration of A600=0.06 and Incubated at 3TC at 200 rpm for 9-10 hours. The 
periplasmic fraction was extracted by osmotic shock method (Anand, Dubuc, et aL. 1991) 
and the presence of dAb in the extract was detected by Westem blotting (MacKenzie, 
Sharma, et al.. 1994). The periplasmic fraction was dialyzed extensively in 10 mM HEPES 
buffer, pH 7.0, 500 mM NaCl. The presence of a C-terminal Hiss tag allowed a one step 
protein purification by IMAC using HiTrap Chelating^'^ column (Amersham Phamacia 
Biotech, Baie d'Urfe, QC, Canada). The 5-ml colunm was charged with Ni^*by applying 30 
ml of a 5 mg/ml NiCl2.6H20 solution and subsequently washed with 15 ml deionized water. 
Purification was carried out as described previously (MacKenzie, Sharma, et al.. 1994) except 
that the starting buffer was 10 mM HEPES buffer, 10 mM imidazole, 500 mM NaCl, pH 7.0, 
and the boxmd protein was eluted with a 10-500 mM imidazole gradient. The purity of the 
protein was determined by SDS-PAGE (Laenunli 1970). NMR samples were prepared by 
concentration and extensive buffer exchanging on a YMIO membrane (Amicon). The final 
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buffer contained 10 mM sodium phosphate, 150 mM NaCl, and 0.2 mM EDTA at pH 6.8. 
The final protein concentration of the NMR samples was ^ 1 mM. 

NMR spectroscopy 

NMR experiments were performed at 298 K and 308 K on a Bruker Avance800 spectrometer 
equipped with pulse field gradient accessories. 2D '^-'H HSQC (Bodenhausen and Ruben, 
1980) was acquired using solvent suppression via the WATERGATE method implemented 
through the 3-9-19 pulse train (Piotto et al., 1992; Sklenar et al., 1993). Triple-resonance 
experiments (Slatter et al.. 1999, and references therein) including HNCACB, 
CBCA(CO)NH, HNCA, HN(CO)CA, HNCO. HBHA(CO)NH, and ^^N-edited 3D NOESY- 
HSQC, 3D TOCSY-HSQC were acquired at 308 K and 298 K for R3A10 (cys-) and M2R2, 
respectively. The NMR data were processed using NMRPipe/NMRDraw (Delaglio et al.., 
1995) and analyzed by the use of NMRView (Johnson and Blevins, 1994). Chemical shifts 
were referenced internally to sodium 2,2-dimethyl-2-silapentane-5-sulphonate (DSS) for 
proton and calculated for and '^C assuming y'^/y'H = 0.1013291 18 and y^^C/y^H = 
0.251449530 (Wishart et al.., 1995). 

Example IS - Testing of the Phage Display A6 dAb Library Against the Anti-FLAG M2 
Monoclonal IgG Antibody 

The phage display A6-bas6d dAb Ubrary was panned against the anti-FLAG M2 monoclonal 
antibody as described by the New England Biolabs (Beverly, MA) (NEB Technical Bulletin 
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(1998): Ph.D.6 Phage Display Peptide Library Kits; Knappik and Piuckthun, 1994); the 
FLAG peptide epitope recognized by the M2 monoclonal antibody is (X)YKXXD where the 
first position has a preference for aspartic acid (Miceli et al., 1994). On a random basis, 
considering the length of the randomized region of A6 CDR3 (i.e., 20 residues), the 
consensus sequence should occur at a frequency of 4x10"^. Thus, in the A6 dAb library with 
2x10^ individual clones, the FLAG peptide epitope should be represented by approximately 
5x10^ independent clones. 

After three rounds of panning against M2 IgG thirty one clones from rounds two and three 
were selected and their A6 dAb genes sequenced. Twelve different A6 dAb genes with the 
FLAG consensus sequence were identified (Table 1, first twelve entries). 

Example 16 - Introducing Genetic Variation into the Sequence Corresponding to the A6 
Heavy Chain CDR3 Region - Randomized Residues 

Oligonucleotides comprising randomly mutated CpR3 regions were prepared on an Applied 
Biosystems 394 DNA synthesizer as described above, 

7. Production of 23 randomized residues (CDR3 1-23): 

The anti-codon formula [(A/C)NN] is used resulting in a reduction in possible codon usage 
from 64 to 32 and reduces the number of possible stop codons. Position one, therefore, 
comprises only A andC in the synthetic reaction mixture. For complete randomization of the 

62 



SUBSTITUTE SHEET (RULE 26) 



wo 01/18058 



PCT/CAOO/01027 



second and third positions of the codons the dNTP mixture comprise 25% each of A,G,C and 
T. 

The 3* oligonucleotide randomizing primer was designed such that the last IS nucleotides of 
framework 3 and the first 17 nucleotides of framework 4 were kept constant for 
hybridization. The nucleotides encoding the intervening amino acids, namely amino acids 1 - 
23 of the CDR3 region were randomized using the following primer: 
5* (GTTGTCCCTTGGCCCCA n[(A/C)NN]TTTCACACAGTAATA] 3' (Where 
n^23,antisense strand). 

Using a 50% A and 50% C for the first nucleotide position for each anti-codon triplet and 
25% each of A, C, G, and T for the second and third nucleotide positions for n=23, complete 
randomization of the 23 amino acids of the A6 CDR3 is achieved. 

2, Synthesis o f CPUs comprising 15-23 residues 

The primers are adapted by reducing n to 1 5-23 in the above primer formulae whilst keeping 
the flanking nucleotides constant. 

3. For synthesis of CDR3s comprising 24-33 residues 

1. The primers would be adapted by increasing n to 24-33 in the above primer formula while 

keeping the flanking nucleotides constant. 
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Parental Tyrosine 



To achieve approximately 50% homology to wild type at any one position in the A6 dAb 
CDR3 region during antisense synthesis using the DNA synthesizer, the following example 
would be used. In the case of tyrosine, which is encoded by TAC or TAT (antisense strand 
GTA or ATA) the nucleotides would be spiked as follows for the antisense strand. 

First anticodon nucleotide position: 80% of A and 20% of C is added to the dNTP 

solution, and G and T are not added to reduce codon degeneracy. 

Second anticodon nucleotide position: 80% T and approximately 6.67% of C, 6.67 of 

A and 6.67% of G. 

Third anticodon nucleotide position the mixture: 80% of A and approximately 6.67% 
of T and 6.67% of G and 6.7% C. 

The calculated probability of tyrosine would thus be 0.8x 0.8 x 0.8 x 100% = 51 .2 %, Thus 
^proximately 5 1% of the chains of the library will contain a wild-type A6 tyrosine in that 
specified position. 

Example 18 ■ Selective Randomization Biasing for 50% Homology to Parental Serine 

Using the same strategy in order to achieve approximately 50% homology to wild type serine 
at one or more positions, the following example is useful. 
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Using only A and/or C in the first anticodon position the amino acid serine could have two 
codons these are AGT, TCT and TCG (antisense ACT, AGA and CGA, respectively). The 
nucleotide spiking levels would be as follows: 

First anticodon nucleotide position: 50% A and 50% C. 

Second anticodon nucleotide position: 35.35% C, 35.35% G, 14.65% A and 14.65% T 
Third anticodon nucleotide position: 35.35% A, 35.35% T, 14.65% C and 14,65% G. 
The probability of producing serine for any given fragment, using this strategy is (1 x 
[0.3535+0,3535] x [0.3535+0.3535] x 100% = 50 %. Thus, approximately 50% of the chains 
will have a serine in the selected position. 

Example 19 - Selective Randomization Biasing for 50% Homology to Parental Serine 

To achieve approximately 10% homology to wild type at any one position in the A6 dAb 
CDR3 region during antisense synthesis using the DNA synthesizer, the following example 
can be used. In the case of tyrosine which is encoded by TAG or TAT (antisense strand GTA 
or ATA) the nucleotides would be spiked as follows for the anti sense strand. 

First anticodon nucleotide position: 47% of A and 53% of C is added; G and T are not 
added to reduce codon degeneracy. 

Second anticodon nucleotide position: 47% T and approximately 17.67 % of C, 17.67 
of A and 17.67% of G. 

Third anticodon nucleotide position: 47% of A and approximately 1 7.67% of T and 
17.67% of G and 17.67% C. 
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The calculated probability of tyrosine is thus 0.47x 0.47 x 0.47 x 100% = 10.4 %. Thus ^ 
approximately 10% of the chains of the library will contain a wild-type A6 tyrosine in that 
specified position. 

Example 20 - Selective Randomization Biasing for 50% Homology to Parental Serine 

To achieve approximately 90% homology to wild-type amino acids at any positions in the A6 
d.A.b CDR3 region during antisense synthesis using the DNA synthesizer , the following 
example would be used. In the case of tyrosine which is encoded by TAC or TAT (antisense 
strand GTA or ATA) the nucleotides would be spiked as follows: 

First anticodon nucleotide position: 97% of A and 3% of C is added, G and T are not 

added to reduce codon degeneracy. For this reason, only A and C are used in the first 

anticodon position for all 20 naturally occurring amino acids. 

Second antcodon nucleotide position: 97% T and approximately 1 % of C, 1% of A 

and l%ofG. 

Third anticodon nucleotide position: 97% of A and approximately 1% of T and 1% of 
Gandl%C. 

The calculated probability of tyrosine would be 0.97x 0.97 x 0.97 x 100% = 91.3 %. Thus 
approximately 90% of the chains of the library will contain a wild-type A6 tyrosine in that 
specified position. 

Using the approaches in the examples above, approximately 10 % to approximately 90 % of 
wild type amino acid representation at one or more specified amino acid residues in the A6 
CDR3 can be calculated and applied to the DNA synthesizer. 
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Example 21 



Removal of the reco mbination site 



Figure 15 shows a schematic representation of the steps taken to remove the putative 
recombination site at the 5' end of the A6VH gene. For simplicity only the part of the 
plasmid spanning from RP (1) primer binding site to FP (2) primer binding site and 
containing the A6VH gene is shown. 3=Chi.F primer; 4=Chi.R primer (explained below) 

The codons for amino acids 3-16 surrounding the recombination site were changed (Figure 
15). Briefly, using the Chi.R- 

5'(CAATTACAAGAAAGTGGTGGCGGACTGGTGCAACCAGGAGGATCCCTGAGAC 
TC)3'/FP and Chi.F-5'(ACTTTCTTGTAATTGGACCTCGGCCTGCGC)3'/RP primers 
pairs and pSJF-A6VH plasmid as template two 5' and 3' fragments were synthesised by PGR 
in a total volume of 50 ul containing 10 pmol each of the two primers, 2 mM each of the four 
dNTPs, Ix buffer and 2.5 units of AmpliTaq™ DNA polymerase (Perkin Ehner). The PGR 
protocol consisted of an initial denaturation step at 94^ for 3 min followed by 30 cycles of 
94»C for 30 sec, 55°C for 30 sec, 72°C for 1 min and a final extension step at 72°C for 10 
min. The two fragments were gel purified using the QIAquick Gel Extraction^" kit 
(QIAGEN), and a larger construct was assembled from the 5' and 3' fragments by performing 
^lice overlap extension (SOE). Briefly, the reaction vial containing both 5' and 3' fragments, 
200 jiM each of the four dNTPs, 5 ^l lOX buffer (NEB), and 2 units of Vent DNA 
polymerase (NEB) was subjected to 7 cycles of 1 min at 94°C and 2.5 min at 72°C. To 
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amplify the assembled construct, RP and FP primers were added at a final concentration of 1 
pmol/fil and the mixture was subjected to 30 cycles of 1 min at 94**C, 30 s at SS^'C, and 1 min 
at IT'C The amplified product was purified (QIAquick PCR Purification™ kit) and 
subsequent sequencing revealed that the desired mutations had been incorporated into the 
VH. 

The present invention may be embodied in other specific forms without departing from the 
spirit or essential characteristics thereof. Certain adaptations and modifications of the 
invention will be obvious to those skilled in the art. Therefore, the presently discussed 
embodiments are considered to be illustrative and not restrictive. It is understood that the 
claims may refer to aspects or embodiments of the invention that are only inferentially 
referred to in the disclosure. 
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1 . A combinatorial library comprising variants of a parental ligand binding molecule, 
wherein said parental ligand binding molecule comprises an immunoglobulin Vh 
fragment comprising at least in substantial part, at least the FR regions of the 
immunoglobulin domain depicted in one of Figures 1 or 2 and wherein said 
variants comprise, at least in substantial part, at least the FR regions of the 
immimoglobulin domain depicted in one of Figures 1 or 2 and differ from said 
parental ligand binding molecule at amino acid residues constituting at least part of at 
least one of the CDRs of said parental ligand binding molecule. 

2. A library according to claim 1, wherein said parental ligand binding molecule is a 
substantially intact four chain antibody or a binding fragment thereof including an Fd 
fragment, an Fab fragment, an Fabc fragment, a F(ab')2 fragment, F(ab)2 fragment, a 
single chain V region fragment (scFv), or a fusion polypeptide, wherein the fusion 
polypeptide comprises any such parental ligand binding molecule fused to another 
polypeptide. 

3. A library according to claim 1 , wherein said parental ligand binding molecule is a 
dAb. 

4. A library according to claim 1, having a substantial representation of variants which 
have a CDRS that is 16 to 33 amino acids in length. 

5. A library according to claim 4, wherein substantially all of said variants have a CDR3 
that is the same length. 

6- A library according to claim 4, wherein said variants have CDR3s which vary in 
length. 

7. A library according to claim 5 or 6, wherein a substantial proportion of said variants 
have a CDR3 that is 18 to 28 amino acids in length. 
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8. A library according to claim 5 or 6, wherein a substantial proportion of said variants 
have a CDR3 that is 20 to 25 amino acids in length. 

9. A library according to claim 5, wherein a substantial proportion of said variants have 
a CDR3 that is 23 amino acids in length. 

10. A library according to claim 4, wherein said variants vary from said parental ligand 
binding molecule in an amino acids constituting at least part of the CDR3. 

11. A library according to claim 10, wherein said parental ligand binding molecule 
comprises an immunoglobulin binding fragment comprising, at least in substantial 
part, the CDR3 region of the immunoglobulin domain depicted in Figure 1 . 

12. A library according to claim 4, whereiii said parental ligand binding molecule 
comprises an immunoglobulin Vh binding fragment comprising, at least in substantial 
part, the CDR regions of the immimoglobulin Vh domain depicted in Figure 1. 

13. A library according to claim 4, wherein said variants comprise the same FR regions as 
said parental binding molecule. 

14. A phage display library according to claim 4, wherein said parental ligand binding 
molecule comprises the entire FR regions of the immunoglobulin domain depicted 
in one of Figures 1 and 2. 

15. A library according to claim 1, wherein said parental ligand binding molecule 
comprises at least in substantial part the FR2 region of the inmiunoglobulin Vh 
domain depicted in Figure 1, including residues 44, 45 and 47, and wherein the FR2 
regions is at least partially randomized to generate variants having one or more 
hydrophilic amino acids at VH-VL interface. 
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16. A library according to claim 4, wherein said variants vary from said parental ligand 
binding molecule at amino acids which are proximal to the carboxy terminus of the 
CDR3. 

17. A library according to claim 12, wherein said variants vary from said parental ligand 
binding molecule in an amino acids which are immediately upstream of position 
lOOo. 

18. A library according to claim 4, wherein said variants vary from said parental ligand 
binding molecule in an amino acids lOOi to lOOn identified in SEQ. ID. NOS.: 1, 2 or 
3. 

19. A library according to claim 4, wherein said parental ligand binding molecule is 
derived from a human Vh domain identified in Figure 1 or is built on any firamework 
which is at least 80% homologous (preferably 85% homologous, more preferably at 
least 90% homologous) to the fi:amework and other conserved regions of said human 
V„ domain. 

20. A library according to any claim 4 or 19, wherein said parental ligand binding 
molecule is built on a framework which is at least 80% homologous (preferably 
85% homologous, more preferably at least 90% homologous) to the framework 
regions and conserved regions of a human domain. 

2L A library according to claim 4 or 20, wherein said parental ligand binding molecule is 
built on a V|| framework which is at least 80% homologous (preferably 85% 
homologous, more preferably at least 90% homologous) to the firamework regions and 
conserved regions of a human domain derived from a IgM. 

22. A library according to claim 19, 20 or 21, wherein said parental ligand binding 

molecule is encoded by a nucleic acid sequence comprising nucleic acid residues 6*48 
as shown in Figure 3. 
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23. A library according to claiin 19, 20 or 21, wherein said parental ligand binding 
molecule is encoded by the nucleic acid sequence depicted in Figures 3 or 4. 

24. A library according to claim 1, wherein said parental ligand binding molecule 
comprises at least in substantial part the FR2 region of the immunoglobulin 
domain depicted in Figure 1, including amino acid residues 44, 45 and 47. 

25. A library according to claim 15 or 24, wherein one or more residues selected from 
residues 4 to 21 in FRl are partially randomized. 

26. A library according to claim 15, 24 or 25, wherein one or more residues selected from 
residues lOOo to 1 14 in FR4 are partially randomized. 

27. A library according to claim 1 5, 25 or 26, wherein the residues selected for partial 
randomization lOOo to 114 in FR4 are randomized at least 75% and preferably 90% in 
favour of the residues depicted in Figure 1. 

28. A library according to any of the preceding claims which is a phage display library. 

29. A phage display library displaying a plurality of different variants of a parental ligand 
binding molecule, wherein said parental ligand binding molecule comprises an 
immunoglobulin binding fragment comprising, at least in substantial part, at least 
the FR regions of the immvmoglobulin Vh fragment depicted in one of Figures 1 or 2 
and wherein said variants are encoded by nucleic acid sequences which vary from the 
nucleic acid sequence encoding said parental ligand binding molecule in a 
subsequence encoding at least part of one of the CDRs of said parental Ugand binding 
molecule, whereby said plurality of variants comprise at least in substantial part, the 
FR regions of the immunoglobulin Vh fragment depicted in such Figure 1 or 2 and are 
differentiated, at least in part, by amino acid variations encoded by variations in said 
subsequence. 
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30. A heterogeneous population of genetic packages (eg. phage) having a genetically 
determined outer surface protein, wherein the genetic packages collectively display a 
plurality of different, preferably human, (ie. substantial identity to human) V„ 
ligand-binding fragments, each genetic package including a nucleic acid construct 
coding for a fusion protein which comprises at least a portion of the outer surface 
protein and a variant of at least one soluble parental ligand-binding fragment 
preferably derived from or having a substantial part of the FR regions of the amino 
acid sequence identified in one of Figures 1 or 2, (or a sequence at least 80%, 
preferably 85 to 100%, more preferably 90-100%, homologous (ie. identity) thereto), 
wherein the Vh binding-fragment preferably spans from a position upstream of an 
immunoglobulin heavy chain CDRl to a position downstream of CDR3 (preferably 
including substantially all of FRl and/or FR4), and wherein at least part of a CDR, 
preferably the CDR3, is a randomly generated variant of a CDR of said parental V„ 
ligand binding-fragment and wherein the fusion protein is preferably expressed in the 
absence of an immunoglobulin light chain whereby the potential Vh binding 
fragments are, on the whole, better adapted to be or better capable of being expressed 
as soluble proteins. 

31. A ligand binding molecule which is a variant of a parental ligand binding molecule 
which comprises an immunoglobulin Vh binding domain, said parental binding 
molecule comprising, at least in substantial part, at least the FR regions of the 
immunoglobuUn Vh fragment depicted in one of Figures 1 or 2 and wherein said 
variant comprises, at least in substantial part, the FR regions of the immunoglobulin 
Vh fragment depicted in the corresponding such Figure 1 or 2 and differs from said 
parental ligand binding molecule at amino acid residues constituting at least part of 
one of at least one of the CDRs of said parental hgand binding molecule. 

32. A hgand binding molecule which is derived from a variant of a parental ligand 
binding molecule which comprises an immunoglobulin Vh binding domain, said 
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parental binding molecule comprising, at least in substantial part, at least the FR 
regions of the immunoglobulin Vh domain depicted in one of Figures 1 or 2 and 
wherein said variant comprises, at least in substantial part, the FR regions of the 
immunoglobulin V|, fragment depicted in the corresponding such Figure and differs 
from said parental ligand binding molecule at amino acid residues constituting part of 
one of the CDRs of said parental ligand binding molecule. 

33. A ligand binding molecule which has been identified as binding to a target ligand by 
screening a library according to claims 1 to 24 for one or more ligand binding 
molecules which specifically recognize said target ligand. 

34. . A combinatorial library comprising variants of a parental ligand binding molecule, 

wherein said parental ligand binding molecule comprises an immunoglobulin 
fiagment comprising at least in substantial part, at least the FR regions of the 
immunoglobulin Vh domain depicted in Figure 1 and wherein said variants comprise, 
at least in substantial part, at least the FR regions of the immunoglobulin domain 
depicted in Figure 1 and differ from said parental ligand binding molecule at amino 
acid residues constituting part of at least one of the CDRs of said parental ligand 
binding molecule. 

35. A Ubrary according to claim 34, wherein at least a substantial number of said variants 
comprise at least one of the following mutations: 

G44E 
L45R 
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Y47G 
V93A 
K94A 

36. A library according to claim 34, wherein at least a substantial number of said variants 
comprise the following mutations: 

G44E 
L45R 
Y47G 

37. A library according to claim 34, wherein at a substantial number of said variants 
comprise the following mutations: 

G44E 
L45R 
Y47G 
V93A 
K94A 

38. A library comprising a heterogeneous population of genetic packages which 
collectively display a plurality of different potential Vh binding fragments, each said 
genetic package having: 

(a) an outer sur&ce having an outer surface protein; and 

(b) a nucleic acid construct coding for a fusion protein, said fusion protein 
including: 
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(i) at least a portion of said outer surface protein; and 

(ii) a Vh binding-fragment spanning from a position upstream of an 
immunoglobulin heavy chain CDRl to a position downstream of CDR3, 
wherein at least part of said CDR3 is a randomly generated variant of a CDR3 
of a non-camelid or a non-camelid type parental V„ binding-fragment; and 

wherein said fusion proteins are expressed in the absence of an immunoglobulin light 
chain protein or portions thereof on said outer surface of said genetic packages, and 
wherein said potential V„ binding fragments are adapted to be or capable of being 
expressed as soluble proteins. 

39. A library as claimed in claim 38, wherein said potential binding fragments have a 
CDR3 length of 16 to 33 amino acids. 

40. A library as claimed in any one of claims 38 or 39, wherein said binding-fragment 
comprises fragments FRl to FR4. 

41. A library as claimed in any one of claims 38-40, wherein each said genetic packages 
is a phage and said library is a phage display library. 

42. A library as claimed in claim 41 , wherein said binding fragments comprise 
fragments FRl to FR4. 
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43. A library as claimed in claim 39, wherein CDR3s of a variety of different lengths 
from 16 to 33 amino acids are predominantly represented in said potential binding 
fragments. 

44. A library as claimed in claim 43, wherein CDR3s of a variety of different lengths 
from 17 to 23 amino acids are predominantly represented in said potential V„ binding 
fragments. 

45. A library as claimed in claim 44, wherein CDR3s of 23 amino acids in length are 
predominantly represented in said potential Vh binding fragments. 

46. A library as claimed in any one of claims 38-45, wherein said potential V„ binding- 
fragment is built on a Vh framework which is at least 80% homologous to the 
framework regions of human V^. 

47. A library as claimed any one of claims 38-46, wherein said parental V„ binding- 
fragment is derived from a human V„ chain identified in Figure 1 or is built on any 
framework which is at least 80%homologous to the framework and other conserved 
regions of said human V|, chain. 

48. A library as claimed in claim 38, wherein said parental binding-fragment is 
adapted or adaptable to a human framework. 
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49. A library as claimed in any one of claims 38-48, wherein the amino acids in one or 
more series of CDR3 amino acids selected from the groups of amino acids consisting 
of 95-100 and lOOi-lOOn are preserved in approximately at least 90% or 
approximately 100% of said potential Vh binding fragments. 

50. A library as claimed in any one of claims 38-49, wherein one or more amino acids in 
one or more series of CDR3 amino acids selected from the groups of amino acids 
consisting of 95-100, lOOi-lOOn, 100o-102 and 101-102 of Figure 4are preserved, on 
an amino acid by amino acid basis, in approximately at least 90% or approximately 
100% of said potential Vh binding fragments. 

51. A library as claimed in any one of claims 38-50, wherein said potential binding 
fragments have a native human Vl interface at positions 44, 45, and 47 of Figure 1. 

52. A library as claimed in any one of claims 38-51, wherein said potential Vj, binding 
fragments have non-hydrophobic amino acids at least one of positions 44, 45, and 47 
of Figure 1. 

53. A library as clainied in any one of claims 38-52, wherein said potential Vj, binding 
fragments are fruther characterized by a CDR3 containing an amino acid sequence 
which is at least 90% homologous to at least one region of conserved amino acids 
selected from those regions identified in Figure 1. 

54. A library as claimed in any one of claims 38-53, wherein said potential V„ binding 
fragments are fiirthe r characterized in that at least approximately 50% of the amino 
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acids corresponding to the amino acids at positions 100a- 1 OOh shown in Figiire 1 are 
biased in favor of wild-type A6 to produce at least 10% wild-type amino acid at said 
positions in said potential V„ binding fragments. 

55. A library as claimed in claim 54, wherein said potential Vh binding fragments are 
furthe r characterized in that at least approximately 90% of the amino acids 
corresponding to the amino acids at positions 1 00a- 1 OOh shown in Figure 1 are each 
10% biased in favor of wild-type A6 to produce at least 10% wild-type amino acid at 
said positions in said potential Vh binding fragments. 

56. A library as claimed in any one of claims 38-55, wherein one or more individual 
amino acids in positions lOOa-lOOb and lOOg-lOOh, or lOOa-lOOc and lOOf-lOOh, are 
wild-type in at least approximately 10% of said potential Vh binding fragments. 

57. A library as claimed in claim 56, wherein individual amino acids in positions 100a- 
100b and lOOg-lOOh, or lOOa-lOOc and lOOf-lOOh, are wild-type in at least 
approximately 50% of said potential Vh binding fragments. 

58. A library as claimed in any one of claims 38-57, wherein at least 50% of individual 
amino acids in positions 95-100 are biased in favor of wild type to produce at least 
10% wild-type amino acid at said positions in said potential Vh binding fragments. 

59. A library as claimed in any one of claims 38-58, wherein at least 90% of the 
individual amino acids in positions 95-100 of Figure 1 are biased in favor of wild-type 
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to produce at least 10% wild-type amino acid at said positions in said potential 
binding fragments. 



60. A library as claimed in any one of claims 38-59, wherein at least 50% of the 

individual amino acids in positions lOOi - lOO/i in Figure 1 are biased in favor of wild- 
type to produce at least 10% wild-type amino acid at said positions in said potential 
V„ binding fragments. 

A library as claimed in claim 60, wherein at least 50% of the individual amino acids 
in positions 100* - lOOn in Figure 1 are biased in favor of wild-type to produce at least 
50% wild-type amino acid at said positions in said potential Vh binding fragments. 

62. A library as claimed in any one of claims 38-61, wherein individual amino acids in 
any one or more of positions lOOa-lOOb, lOOg-lOOh, 1001 and lOOo are biased to 
produce at least 10% of wild-type amino acids, aromatic amino acids or amino acids 
selected exclusively from the group consisting of tyrosine, histidine, glutamine, 
asparagine, lysine, aspartic acid and glutamic acid, wild-type amino acid at said 
positions in said potential V^, binding fragments. 

63. A library as claimed in any one of claims 38-62, wherein amino acids in any one or 
more of positions lOOa-lOOb, lOOg-lOOh, 1001 and lOOo are biased to produce at least 
50% of wild-type amino acids, aromatic amino acids or amino acids selected 
exclusively from the group consisting of tyrosine, histidine, glutamine,. asparagine, 
lysine, aspartic acid and glutamic acid, at said positions in said potential V„ binding 
fragments. 
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64. A library as claimed in any one of claims 38-63, wherein at least 5 consecutive amino 
acid positions among positions 95-lOOn shown in Figure 1 are biased to produce at 
least 10% wild-type amino acids at said positions of said potential V„ binding 
fragments. 

65. A library as claimed in any one of claims 38-64, wherein at least 8 consecutive amino 
acids positions among residues 95-lOOn shown in Figure 1 are biased to produce at 
least 10% wild-type amino acids at said positions of said potential binding 
fragments. 

66. A library as claimed in any one of claims 38-65, wherein at least 10 consecutive 
amino acids among residues 95-lOOn shown in Figure 1 are biased to produce at least 
10% wild-type amino acids at said positions of said potential binding fragments. 

57. A library as claimed in any one of claims 38-66, wherein at least amino acids 
positions lOOa-lOOb to lOOf-lOOh and 100m are biased to produce at least 50% wild- 
type amino acids at said positions of said potential Vh binding fragments. 

58. A library as claimed in any one of claims 38-67, wherein at least amino acids 
positions lOOf to 100m are biased to produce at least 50% wild-type amino acids at 
said positions of said potential Vj, binding fragments. 
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69. A library as claimed in any one of claims 38-68, wherein at least amino acids 
positions lOOo to 102 or 101 to 102 are biased to produce at least 10% wild-type 
amino acids at said positions of said potential binding fragments. 

70. A library as claimed in any one of claims 38-69, wherein framework regions are at 
least approximately 90% homologous to that of the wild-type parental binding- 
fragment shown in Figure 1. 

71. A library as claimed in any one of claims 38-70, wherein the CDR2 region is at least 
approximately 80% homologous to that of the wild-type parental binding-fragment 
shown in Figure 1. 

72. A library as claimed in any one of claims 38-71, wherein the CDRl region is at least 
approximately 80% homologous to that of the wild-type parental binding-fragment 
shown in Figure 1. 

73. A library as claimed in any one of claims 38-72, wherein the CDRl region is biased 
to have a cysteine residue for forming a loop in said binding fragment by means 
of interaction of said cysteine with a randomly generated cysteine residue in CDR3. 

74. A library as claimed in any one of claims 38-73, wherein said recombinant phage are 
constructed in an M-13 derived vector and said phage coat protein is pin. 
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75. A library comprising a heterogeneous population of genetic packages which 
collectively display a plurality of different potential binding fragments, each said 
genetic package having: 

(a) an outer surface having an outer surface protein; and 

(b) a nucleic acid constmct coding for a fusion protein, said fusion protein 
including: 

(i) at least a portion of said outer surface protein; and 

(ii) a randomly generated variant of a non-camelid or a non*camelid 
type parental binding fragment; 

wherein at least a part of said construct is biased in favor of producing said fusion 
proteins which are expressed as soluble proteins. 

76. A library comprising a heterogeneous population of genetic packages which 
collectively display a plurality of different potential binding fragments, each said 
genetic package having: 

(a) an outer surface having an outer surface protein; and 

(b) a nucleic acid construct coding for a fusion protein, said fusion protein 
including: 

(i) at least a portion of said outer surface protein; and 

(ii) a randomly generated variant of a non-camelid or a non-camelid 
type parental binding fragment; 

wherein at least a part of said constmct is biased in favor of producing said fusion 
proteins having the amino acid constmct of said parental binding fragment. 
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77. A library as claimed in claim 76, wherein said construct is biased in favor of 
producing soluble fusion proteins. 



78. A library as claimed in claim 77, wherein said parental binding fragment is a 
binding fragment, and said construct either includes at least a portion of amino acids 
95 to lOOo of Figure 1. 

79. A library as claimed in claim 78, wherein said parental binding fragment is a V„ 
binding fragment, and said constmct either includes at least a portion of amino acids 
ofCDR3. 

80. A library as claimed in any one of claims 75-77, wherein said genetic package is a 
phage and said soluble parental binding-fragment is selected from the group 
consisting of an scFv, Fab, V^, Fd, Fabc, F{ab')2, F(ab)2 derived from A6. 

81. A library as claimed in any one of claims 75-80, further comprising a plurality of 
libraries which are pooled, wherein at least a first and a second of said pooled libraries 
differ in the degree of biasing to wild-type amino acids. 

82. A library as claimed in claim 81, wherein said first and said second pooled libraries 
differ with respect to the degree of biasing of CDR3 region to produce fusion proteins 
with differing solubility characteristics. 

83. A library as claimed in claim 82, wherein said first and said second pooled libraries 
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differ with respect to the degree of biasing to produce amino acid that are preferred 
for intermolecular interaction, said amino acids selected from a group including 
tyrosine, histidine, glutamine, asparagine, lysine, aspartic acids aind glutamic acid. 



84. A method for creating a library of soluble proteins expressing heavy chain binding 
domains comprising generating a library of microorganism clones producing variant 
protein heavy chain binding domains by incorporating mutations into the binding 
subunit DNA of a non-camelid parental heavy chain binding domain in said 
microorganism clones. 

85. A method for creating a library expressing binding domains comprising: 

(a) cloning a parental DNA sequence encoding a parental domain to create 
parental clones; 

(b) replacing a variable region of said parental clones with a variant DNA 
sequence by adding by a series of step- wise in vitro syntheses variant nucleic acids to 
positions on said parental clone, said variant nucleic acids corresponding to positions 
of parental nucleic acids, to create a variant DNA sequence; and 

(c) generating a library of genetic packages each having a surface and a 
surface protein expressed on said surface, said surface protein including a variant 
protein binding domain expressed by said variant DNA sequence; 

wherein at step (b) said variant nucleic acids are added from a series of discrete pools 
of nucleic acids, and at least one of said pools is biased in favor of selecting a nucleic 
acid of the corresponding position of the parental nucleic acid.. 
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86. A method as claimed in claim 85, wherein said at least one pool of nucleic acids is 
biased in favor of selecting said corresponding parental nucleic acid by preparing a 
dNTP solution having an excess of said corresponding parental nucleic acid as 
compared to other nucleic acids. 

87. A method as claimed in claim 85, wherein said library is a phage library. 

88. A method as claimed in claim 85, wherein said binding domain is an immunoglobulin 
binding domain. 
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1/22 
Figure 1 



^taructuT-g^ of Vw domain of human A6 antibody. 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 
GAG GTC CAG CTG CAG GAG TCT GGG GGA GGC TTA GTC GAG OCT 
EVQLQE SGGGLVQP 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGG GGG TCC CTG AGA CTC TCC TGT TCA GCC TCT GGA TTC ACC 
GGSLRLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT GCT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y A M H W V R Q A P G 

CDRl 

43 44 45 46 47 48 49 50 51 52 a 53 54 55 
AAG GGA CTG GAA TAT GTT TCA GCT ATT AGT AGT AAT GGG GGT 

K G L E Y V S A I S S N Q G_ 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAC TAC GCA GAC TCC GTG AAG GGC AGA TTC ACC ATC 
STYYAPgVKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAC AAT TCC AAG AAC ACT CTG TAT CTT CAA ATG AGC 
SRDNSKNTLYLQMS 



b c 83 84 85 86 87 88 89 90 91 92 93 94 
AGT CTG AGA GCT GAG GAC ACG GCT GTG TAT TAC TGT GTG AAA 
SLRAEDTAVYYGVK 
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2/22 

PigiiT-f» 1 ( f;r>nt-inued> 

95 96 97 98 99 100 abcdefgh 
GAC AGG TTA AAA GTG GAG TAG TAT GAT AGT AGT GGT TAT TAG 
DRLKVEYYDSS G Y Y 

CDR3 

i j k 1 m n o 101 102 103 104 105 105 107 
GTT TCT CGG TTC GGT GOT TTT GAT ATC TGG GGC CAA GGG ACA 
VSRFQA FDI W G Q G T 



108 109 11.0 111 112 113 
ACG GTC ACC GTC TCA TCA 
T V T V S S 
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3/22 
Figure 2 

Structure of modified Vh domain of human A6 antibody- 
showing substitutions at position 44/ 45, 47, 93 amd 
•94. The NheX site is underlined. 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 
GAG GTC GAG CTG GAG GAG TCT GGG GGA GGC TTA GTC GAG CCT 
EVQLQESGGGLVQP 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGG GGG TCC CTG AGA GTC TCC TGT TCA GCT AGC GGA TTC ACC 
GG SL RLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT GCT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y A M H W V R Q A P G 

CDRl 

43 44 45 46 47 48 49 50 51 52 a 53 54 55 
AAG GAA CGT GAA GGT GTT TCA GCT ATT AGT AGT AAT GGG GGT 
K E R E 6 V S A I S S N G G 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAG TAC GCA GAC TCC GTG AAG GGC AGA TTC ACC ATC 
S TYYADS VKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAC AAT TCC AAG AAC ACT CTG TAT CTT CAA ATG AGC 
SRDNSKNTLYL. QMS 



be 83 84 85 86 87 88 89 90 91 92 93 94 
AGT CTG AGA GCT GAG GAG ACG GCT GTG TAT TAC TGT GCA GCA 
SLRAEDTAVYYCAA 
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Figure 2 (continued) 



95 96 97 98 99 100 abcdefgh 
GAC AGG TTA AAA GTG GAG TAG TAT GAT AGT AGT GGT TAT TAG 
DRI.KVEYYDSSGYY 

CDR3 

i j k 1 m n o 101 102 103 104 105 105 107 
GTT TCT CGG TTC GGT GCT TTT GAT ATC TGG GGC CAA GGG ACA 
VS RFG AFD I W . G Q G T 



108 109 110 111 112 113 
ACG GTC ACC GTC TCA TCA 
T V T V S S 
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Fioure 3 



Structure of Vh domain of hxunan A6 antibody. The 
mutated nucleotides spanning residues 7-48 to remove 
the recombination site are in bold and underlined. 

1 2 3 4 5 6 7 8 9 10 .11 12 13 14 
GAG GTC CAA TTA HAG GAA AGT GGT dGC GGA CTG GTG CAA CCA 
EVQLQESGGGLVQP 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGA GGA TCC CTG AGA CTC TCC TGT TCA GCC TCT GGA TTC ACC 
GGSLRLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT GCT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y A M H W V R Q A P . G 

CDRl 

43 44 45 46 47 48 49 50 51 . 52 a 53 54 55 
AAG GGA CTG GAA TAT GTT TCA GCT ATT AGT AGT AAT GGG GGT 

K G L E Y V S A I S S N S S_ 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAC TAC GCA GAC TCC GTG AAG GGC AGA TTC ACC ATC 
STYYADSVKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAC AAT TCC AAG AAC ACT CTG TAT CTT CAA ATG AGC 
SRDNSKNTLYLQMS 



b c . 83 84 85 86 87 88 89 90 91 92 93 94 
AGT CTG AGA GCT GAG GAC ACG GCT GTG TAT TAC . TGT GTG AAA 
SLRAEDTAVYYCVK 
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Figure f continued) 



95 96 97 98 99 100 a b e d e f g h 
GAC AGG TTA AAA GTG GAG TAG TAT GAT AGT AGT GGT TAT TAG 

DRLKVEYYDS SG Y Y 

CDR3 

i j k 1 m n o 101 102 103 104 105 105 107 

GTT TCT CGG TTC GGT GOT TTT GAT ATC TGG GGC CAA GGG ACA 
VSRFGAFDI W G Q G T 



108 109 

ACG GTC 
T V 



110 111 

ACC GTC 
T V 



112 113 
TCA TCA 
S S 
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Figure 4 

Structure of modified Vh domain of hximem A6 antibody 
showing sxibstitutions at position 44, 45, 47, 93 and 
94. The mutated nucleotides spanning residues 7-48 to 
remove the recombination site as well as the NheX site 
are in bold and underlined. 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 
GAG GTC CAA TTA CAG GAA AGT GOT GGC GGA CTG GTG CAA CCA 
EVQLQE SGGGLVQP 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGA GGA TCC CTG AGA CTC TCC TGT TCA GCT AGC GGA TTC ACC 
GGSLRLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT GCT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y A M H W V R Q A . P G 

CDRl 

43 44 45 46 47 48 49 50 51 52 a 53 54 55 
AAG GAA CGT GAA GGT GTT TCA GCT ATT AGT AGT AAT GGG GGT 
K E R E G V S A I S S N G G 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAC TAC GCA GAC TCC GTG AAG GGC AGA TTC ACC ATC 
ST YYADSVKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAC AAT TCC AAG AAC ACT CTG TAT CTT CAA ATG AGC 
SRDNSKNTLYLQMS 
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Figure 4 (continued) 



be 83 84 85 86 87 88 89 90. 91 92 93 94 
AGT CTG AGA GCT GAG GAC ACG GCT GTG TAT TAG TGT GCA GCA 
SLRAE DTAVYYCAA 



95 96 97 . 98 99. 100 a b C d e f g h 

GAC AGG TTA AAA GTG GAG TAG TAT GAT AGT AGT GGT TAT TAG 

D R L K V E Y Y D S S G Y Y 

CDR3 

i j k 1 m n O 101 102 103 104 105 105 107 

GTT TCT CGG TTC GGT GCT TTT GAT ATC TGG GGC CAA GGG AGA 

VSRFGAFDI W G Q G T 



108 109 110 111 112 113 
ACG GTC ACC GTC TCA TCA 
T V T. V S S 
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Ficmre 5 
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Figure 6 
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Figure 7 
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Figure 8 A 
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Figure 8B 
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Figure 9 
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Figure 10 

Structure of modified Vh domain of human A6 antibody- 
showing substitutions at position 33, 44, 45, 47, 93, 
94 and lOOe. 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 
GAG GTC GAG CTG GAG GAG TCT GGG GGA GGC TTA GTC GAG CCT 
EVQLQESGGGLVQP 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGG GGG TCC CTG AGA GTC TCC TGT TCA GCC TCT GGA TTC ACC 
GGSLRLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT TGT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y C M H W V R Q A P G 

CDRl 

43 44 45 46 47 48 49 50 51 52 a 53 54 55 
AAG GAA CGT GAA GGT GTT TCA GCT ATT AGT AGT AAT GGG. GGT 
K E R E G V S A I S S N G G 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAG TAG GCA GAG TCC GTG AAG GGC AGA TTC ACC ATG 
STYYADSVKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAG AAT TCC AAG AAG ACT CTG TAT GTT GAA ATG AGC 
SRDNSKN TLYLQMS 



b C 83 84 85 86 87 88 89 90 91 92 93 94 
AGT CTG AGA GCT GAG GAC ACG GCT GTG TAT TAG TGT GCA GCA 
SLRAEDTAVYYCAA 
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Figure 1 0 f continued) 



95 96 97 98 99 100 abcdefgh 
GAC AGG TTA AAA GTG GAG TAG TAT GAT AGT TGC GGT TAT TAG 
DRLKVEYYDSCGYY 

CDR3 

i j k 1 m n o 101 102 103 104 105 105 107 
GTT TCT CGG TTC GGT GOT TTT GAT ATC TGG GGC CAA GGG ACA 
V SRFGAFDI W G Q G T 



108 109 110 111 112 113 
ACG GTC ACC GTC TCA TCA 
T V T V S S 
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Figure 11 




dAb designation 
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Figure 12 



M2R2-1 (-Cys) binding to anti-FLAG M2 
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Figure 13 
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Figure 14 



Structure of Vh domain of hxoman A6 antibody. The 
mutated nucleotides spcuining residues 7-48 to remove 
the recombination site are in bold and underlined. 

1 2 3 4 5 6 7 8 .9 10 11 12 13 14 
GAG GTC CAA TTA GAG GAA AGT GGT GGC GGA CTG GTQ CAA CCA 

e Vqlqesggglvqp 

15 16 17 18 19 20 21 22 23 24 25 26 27 28 
GGA GGA TCC CTG AGA CTC TCC TGT TCA GCT AGC GGA TTC ACC 
GGSLRLSCSASGFT 



29 30 31 32 33 34 35 36 37 38 39 40 41 42 
TTC AGT AGC TAT GCT ATG CAC TGG GTC CGC CAG GCT CCA GGG 
F S S Y A M H W V R Q A P G 

CDRl 

43 44 45 46 47 48 49 50 51 52 a 53 54 55 
AAG GGA CTG GAA TAT GTT TCA GCT ATT AGT AGT AAT GGG GGT 
KG L . E Y V S AT S S N G G 

CDR2 

56 57 58 59 60 61 62 63 64 65 66 67 68 69 
AGC ACA TAC TAC GCA GAC TCC GTG AAG GGC AGA TTC ACC ATC 
gT YYA D SVKG R F T I 



70 71 72 73 74 75 76 77 78 79 80 81 82 a 
TCC AGA GAC AAT TCC AAG AAC ACT CTG TAT CTT CAA ATG AGC 
SRDNSKNTLY LQMS 



be 83 84 85 86 87 88 89 90 91 92 93 94 
AGT CTG AGA GCT GAG GAC ACG GCT GTG TAT TAC TGT GTG AAA 
S LRAE DTAVYYCVK 
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Figure 14 (contined) 



95 96 97 98 99 100 abcdefgh 
GAC AGG TTA i^^AA GTG GAG TAG TAT GAT AGT AGT GGT TAT TAG 

DRLKVEYYDSSGY Y 

CDR3 

i j k 1 m n o 101 102 103 104 105 105 107 
GTT TCT CGG TTC GGT GCT TTT GAT ATC TGG GGC CAA GGG ACA 
VSRFGAFDI W G Q G T 



108 109 110 
ACG GTC ACC 
T V T 



111 112 113i 
GTC TCA TCA 
V S S 
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Fjgiiire 15 
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SEQUENCE LISTING 

<110> Novopharm Biotech Inc. 

<120> ENHANCED PHAGE DISPLAY LIBRARIES AND METHODS FOR 
PRODUCING SAME 

<130> 33956-41 

<140> PCT 

<141> 2000-09-07 



<150> CA2282179 
<151> 1999-09-07 

<150> US60/163,546 
<151> 1999-11-04 



<160> 60 



<170> Patentin Ver. 2.1 

<210> 1 
<211> 396 
<212> DNA 
<213> human 



<400> 1 

gaggtccagc 

tcctgttcag 

ccagggaagg 

gcagactccg 

cttcaaatga 

ttaaaagtgg 

atctggggcc 



tgcaggagtc 
cctctggatt 
gactggaata 
tigaagggcag 
gcagtctgag 
agtactatga 
aagggacaac 



tgggggaggc 
caccttcagt 
tgtttcagct 
attcaccatc 
agctgaggac 
tagtagtggt 
ggtcaccgtc 



ttagtccagc 
agctatgcta 
attagtagta 
tccagagaca 
acggctgtgt 
tattacgttt 
tcatca 



ctggggggtc 
tgcactgggt 
atgggggtag 
attccaagaa 
attactgtigt: 
ctcggttcgg 



cctgagactc 60 
ccgccaggct 120 
cacatactac 180 
cactctgtat 240 
gaaagacagg 300 
tgcttttgat 360 
396 



<210> 2 

<211> 132 

<212> PRT 

<213> human 



<400> 2 

Glu Val Gin Leu 
1 

Ser Leu Arg Leu 
20 

Ala Met His Trp 
35 

Ser Ala lie Ser 
50 

Lys Gly Arg Phe 
65 



Gin Glu Ser Gly 
5 

Ser Cys Ser Ala 

Val Arg Gin Ala 
40 

Ser Asn Gly Gly 
55 

Thr lie Ser Arg 
70 



Gly Gly Leu Val 
10 

Ser Gly Phe Thr 
25 

Pro Gly Lys Gly 



Ser Thr Tyr Tyr 
60 

Asp Asn Ser Lys 
75 



Gin Pro Gly Gly 
15 

Phe Ser Ser Tyr 
30 

Leu Glu Tyr Val 
45 

Ala Asp Ser Val 



Asn Thr Leu Tyr 
80 



SUBSTITUTE SHEET (RULE 26) 



wo 01/18058 



PCT/CAOO/01027 



2/17 

Leu Gin Met Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Val Lys Asp Arg Leu Lys Val Glu Tyr Tyr Asp Ser Ser Gly Tyr Tyr 
100 105 110 

Val Ser Arg Phe Gly Ala Phe Asp lie Trp Gly Gin Gly Thr Thr Val 
115 120 125 

Thr Val Ser Ser 
130 



<210> 3 
<211> 5 
<212> PRT 
<213> human 

<400> 3 

Ser Tyr Ala Met His 
1 5 



<210> 4 
<211> 16 
<212> PRT 
<213> human 

<400> 4 

Ala lie Ser Ser Asn Gly Gly Ser Thr Tyr Tyr Ala Asp Ser Val Lys 
15 10 15 



<210> 5 
<211> 23 
<212> PRT 
<213> human 

<400> 5 

Asp Arg Leu Lys Val Glu Tyr Tyr Asp Ser Ser Gly Tyr Tyr Val Ser 
15 10 15 

Arg Phe Gly Ala Phe Asp lie 
20 



<210> 6 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 6 

gccccagata tcaaaacnnt ttcacacagt aata 34 
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<210> 7 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 7 

tgttcagcta gcggattc 



<210> 8 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 8 

tgaggagacg gtgaccgttg tcccttggcc ccagatatca aa 



<210> 9 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 9 

catgaccaca gtgcacagga ggtccagctg caggagtc 



<210> 10 

<211> 18 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

tttcacacag taatacac 



<210> 11 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : primer 
<400> 11 

cgattctgcg gccgctgagg agacggtgac cgttgtccct tggccccaga tatcaaa 57 
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<210> 12 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 12 

gttgtccctt ggccccanac nntttcacac agtaata 37 



<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence : primer 



<210> 14 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 14 

ctctcctgtg ctgcctctgg a 21 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<210> 16 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence .-primer 
<400> 16 



<400> 13 

actttcttgt aattggacct cggcctgcgc 



30 



<400> 15 

tccagaggca gcacaggaga g 



21 
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cgcacagtaa tacacagccg tgtcctcagc tctcagactg ttcatctgaa gata 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence -.primer 

<400> 17 

gtgtattact gtgcgaaaga cagg 



<210> 18 

<211> 21 ^ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence i.primer 
<400> 18 

caattacaag ctagtggtgg c 



<210> 19, 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 19 

tatggatcct gaggagacgg tgacctgtgt cccttggcc 



<210> 20 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 20 

catgaccaca gtgcacagga ggtccaatta caagaaag 



<210> 21 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
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<400> 21 

cccctggccc cagatatcaa aacnntttcg cacagtaata cac 



43 



<:210> 22 

<211> 54 

<212> DNA 

<213> Axtificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 22 

cgattctgcg gccgctgagg agacggtgac ctgtgtccct tggccccaga tatc 54 



<210> 23 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer 



<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 24 

cgccagggtt ttcccagtca cgac 



<210> 25 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 25 

gaggtccaat tacaagctag tggtggcgga ctggtgcaac cagaggttcc ctgagactc 5 9 



<210> 26 
<211> 60 
<212> DNA 

<213> Artificial Sequence 



<400> 23 

gcggataaca atttcacaca ggaa 



24 



<220> 
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<223> Description of Artificial Sequence : primer 
<400> 26 

atcgcagttg cactggctgg tttcgctacc gttgcggagg ccgaggtcca attacaagct 60 



<210> 27 
'211> 54 
^212 > DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer 
<400> 27 

tagagggtag aattcatgaa aaaaaccgct atcgcgatcg cagttgcact ggct 54 



<210> 28 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.primer 
<400> 28 

tatggatcct gaggagacgg tgaccgt 



<210> 29 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 29 

tatgaagaca ccaggccgag gtccagctgc aggag 35 



<210> 30 

<211> 66 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 30 

agcctggcgg acccagtgca tagcatagct actgaaggtg aatccgctag ctgaacagga 60 
gagtct €6 



<210> 31 
<211> 22 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 



<400> 31 

ccagggtttt cccagtcacg ac 



22 



<210> 32 

<211> 51 

<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 32 

tgggtccgcc aggctccagg gaaggaacgt gaaggtgttt cagctattag t 51 



<210> 33 

<211> 51 

<212> DNA 

<213> Artificial Sequence 



<210> 34 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 34 

tgctgcacag taatacacag ccgt 24 



<210> 35 

<211> 35 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer 



<220> 

<223> Description of Artificial Sequence : primer 



<400> 33 

tgttcagcta gcggattcac cttcagtagc tattgtatgc actgggtccg c 



51 



<400> 35 

cgattctgcg gccgctgagg agacggtgac cgttg 



35 



<210> 36 
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<211> 396 
<212> DNA 

<213> human (modified) 
<400> 36 

gaggtccagc tgcaggagtc tgggggaggc ttagtccagc ctggggggtc cctgagactc 60 
tcctgttcag ctagcggatt caccttcagt agctatgcta tgcactgggt ccgccaggct 120 
ccagggaagg aacgtgaagg tgtttcagct attagtagta atgggggtag cacatactac 180 
gcagactccg tgaagggcag attcaccatc tccagagaca attccaagaa cactctgtat 240' 
cttcaaatga gcagtctgag agctgaggac acggctgtgt attactgtgc agcagacagg 300 
ttaaaagtgg agtactatga tagtagtggt tattacgttt ctcggttcgg tgcttttgat 360 
atctggggcc aagggacaac ggtcaccgtc tcatca 3 96 



<210> 37 
<211> 132 
<212> PRT 

<213> human (modified) 
<400> 37 

Glu Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20 25 30 

Ala Met His Trp Val Arg Gin Ala Pro Gly Lys Glu Arg Glu Gly Val 
35 40 45 

Ser Ala lie Ser Ser Asn Gly Gly Ser Thr Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Ala Asp Arg Leu Lys Val Glu Tyr Tyr Asp Ser Ser Gly Tyr Tyr 
100 105 110 

Val Ser Arg Phe Gly Ala Phe Asp lie Trp Gly Gin Gly Thr Thr Val 
115 120 125 

Thr Val Ser Ser 
130 



<210> 38 
<211> 396 
<212> DNA 

<213> human (modified) 
<400> 38 

gaggtccaat tacaggaaag tggtggcgga 
tcctgttcag cctctggatt caccttcagt 
ccagggaagg gactggaata tgtttcagct 
gcagactccg tgaagggcag attcaccatc 



ctggtgcaac caggaggatc cctgagactc 60 
agctatgcta tgcactgggt ccgccaggct 120 
attagtagta atgggggtag cacatactac 180 
tccagagaca attccaagaa cactctgtat 24 0 
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cttcaaatga gcagtctgag agctgaggac acggctgtgt attactgtgt gaaagacagg 3 00 
ttaaaagtgg agtactatga tagtagtggt tattacgttt ctcggttcgg tgcttttgat 360 
atccggggcc aagggacaac ggtcaccgtc tcatca 3 96 



<210> 39 
<211> 132 
<212> PRT 
<213> human 



(modified) 



<400> 39 

Glu Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
1 5 10 . 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20 25 30 

Ala Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Tyr Val 
35 40 45 

Ser Ala lie Ser Ser Asn Gly Gly Ser Thr Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 



Val Lys Asp Arg Leu Lys Val Glu Tyr Tyr Asp Ser Ser Gly Tyr Tyr 
100 105 110 

Val Ser Arg Phe Gly Ala Phe Asp lie Trp Gly Gin Gly Thr Thr Val 
115 120 125 

Thr Val Ser Ser 
130 



<210> 40 
<211> 396 
<212> DNA 

<213> human (modified) 
<400> 40 

gaggtccaat tacaggaaag tggtggcgga 
tcctgttcag ctagcggatt caccttcagt 
ccagggaagg aacgtgaagg tgtttcagct 
gcagactccg tgaagggcag attcaccatc 
cttcaaatga gcagtctgag agctgaggac 
ttaaaagtgg agtactatga tagtagtggt 
atctggggcc aagggacaac ggtcaccgtc 



ctggtgcaac caggaggatc cctgagactc 60 
agctatgcta tgcactgggt ccgccaggct 120 
attagtagta atgggggtag cacatactac 180 
tccagagaca attccaagaa cactctgtat 240 
acggctgtgt attactgtgc agcagacagg 3 00 
tattacgttt ctcggttcgg tgcttttgat 360 
tcatca 396 



<210> 41 

<211> 132 

<212> PRT 

<213> human (modified) 
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<400> 41 

Glu Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20 25 30 

Ala Met His Trp Val Arg Gin Ala Pro Gly Lys Glu Arg Glu Gly Val 
35 40 45 

Ser Ala lie Ser. Ser Asn Gly Gly Ser Thr Tyr Tyr Ala Asp. Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Ala Asp Arg Leu Lys Val Glu Tyr Tyr Asp Ser Ser Gly Tyr Tyr 
100 105 110 

Val Ser Arg Phe Gly Ala Phe Asp He Trp Gly Gin Gly Thr Thr Val 
115 120 125 

Thr Val Ser Ser 
130 



<210> 42 
<211> 396 
<212> DNA 

<213> human (modified) 
<400> 42 

gaggtccagc tgcaggagtc tgggggaggc ttagtccagc ctggggggtc cctgagactc 60 
tcctgttcag cctctggatt caccttcagt agctattgta tgcactgggt ccgccaggct 120 
ccagggaagg aacgtgaagg tgtttcagct attagtagta atgggggtag cacatactac 180 
gcagactccg tgaagggcag attcaccatc tccagagaca attccaagaa cactctgtat 240 
cttcaaatga gcagtctgag agctgaggac acggctgtgt attactgtgc agcagacagg 3 00 
ttaaaagtgg agtactatga tagttgcggt tattacgttt ctcggttcgg tgcttttgat 360 
atctggggcc aagggacaac ggtcaccgtc tcatca 396 



<210> 43 
<211> 132 
<212> PRT 

<213> human (modified) 
<400> 43 

Glu Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20 25 30 

Cys Met His Trp Val Arg Gin Ala Pro Gly Lys Glu Arg Glu Gly Val 
35 40 45 
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Ser Ala lie Ser Ser Asn Gly Gly 
50 55 

Lys Gly Arg Phe Thr lie Ser Arg 
65 70 

Leu Gin Met Ser Ser Leu Arg Ala 
85 

Ala Ala Asp Arg Leu Lys Val Glu 
100 

Val Ser Arg Phe Gly Ala Phe Asp 
115 120 



12/17 

Ser Thr Tyr Tyr Ala Asp Ser Val 
60 

Asp Asn Ser Lys Asn Thr Leu Tyr 
75 80 

Glu Asp Thr Ala Val Tyr Tyr Cys 
90 . 95 

Tyr Tyr Asp Ser Cys Gly Tyr Tyr 
105 110 

lie Trp Gly Gin Gly Thr Thr Val 
125 



Thr Val Ser Ser 
130 



<210> 44 
<211> 396 
<212> DNA 
<213> human 



(modified) 



<400> 44 

gaggtccaat tacaggaaag tggtggcgga ctggtgcaac caggaggatc cctgagactc 60 
tcctgttcag ctagcggatt caccttcagt agctatgcta tgcactgggt ccgccaggct 12 0 
ccagggaagg gactggaata tgtttcagct attagtagta atgggggtag cacatactac 180 
gcagactccg tgaagggcag attcaccatc tccagagaca attccaagaa cactctgtat 24 0 
cttcaaatga gcagtctgag agctgaggac acggctgtgt attactgtgt gaaagacagg 300 
ttaaaagtgg agtactatga tagtagtggt tattacgttt ctcggttcgg tgcttttgat 360 
atctggggcc aagggacaac ggtcaccgtc tea tea 396 



<210> 45 
<211> 132 
<212> PRT 

<213> human (modified) 



<400> 45 

Glu Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20- 25 30 

Ala Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Tyr Val 
35 40 45 

Ser Ala lie Ser Ser Asn Gly Gly Ser Thr Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 
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Val Lys Asp Arg Leu Lys Val Glu 
100 

Val Ser Arg Phe Gly Ala Phe Asp 
115 120 



Tyr Tyr Asp Ser Ser Gly Tyr Tyr 
105 110 

lie Trp Gly Gin Gly Thr Thr Val 
125 



Thr Val Ser Ser 
130 



<210> 46 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 46 

Val Gin Tyr Gly Lys His Arg Arg Gly Ser Cys lie Glu Val His Pro 
1 5 10 15 . 

Glu Tyr Lys Asp Phe Asp lie 
20 



<210> 47 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 47 

Asn Pro Pro Lys Pro Gly Ala Gin Ala Arg Cys Val Thr Thr Val Lys 
15 10 15 

Asp Tyr Lys Glu Phe Asp lie 
20 



<210> 48 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 48 

Ala Ala lie Gin Thr Glu Thr Ala Arg Trp Cys Asp Arg His Pro Val 
15 10 15 

Ser Tyr Lys Met Phe Asp lie 
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20 



210> 49 
211> 23 
2iri> PRT 

213> Artificial Sequence 

!23> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

In^Thr^Glu Thr Gin Pro Leu Tyr Asn Asp Cys He Leu Arg Gin Ala 
1 5 10 

Vly Tyr Lys Trp Phe Asp He 
20 



<210> 50 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 50 „ 
Met His Thr Leu Gin His Tyr Arg Asn Leu Cys Ser Tyr Gin 
1 5 10 

Asp Tyr Lys His Phe Asp He 
20 



<210> 51 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 51 rr^ r 

Glv Leu Ser Gly Ser Arg Pro Asn Glu Gin Cys Asp Tyr Lys 
1 5 10 

Asp His Val Gin Phe Asp He 
20 



<210> 52 

c211> 23 

<212> PRT 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence : recombinant 
A6>derived peptide 

<400> 52 

Leu Ser Gly Gin Asn Tyr Thr Lys Thr Arg Cys Leu Val Met Gin Asn 
1 S . 10 . 15 

Asp Tyr Lys Met Phe Asp lie 
20 



<210> 53 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SeG[uence : recombinant 
A6 -derived peptide 

<400> 53 

Thr Ala Glu Pro Ala Leu Ser Pro Gin Ala Cys Met Thr Lys Glu Arg 
15 10 15 



Gin Tyr Lys Asp Phe Asp lie 
20 



<210> 54 

<211> 23 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 54 

Glu Thr Tyr Met Tyr Thr Arg Gly Lys Tyr Cys Arg Ala Leu Ser Ala 
15 10 15 



Asp Tyr Lys Leu Phe Asp lie 
20 



<210> 55 

<211> 23 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 55 

Glu Thr Tyr Met Tyr Thr Arg Gly Lys Tyr Cys Arg Ala Leu Ser Ala 
15 10 15 
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Asp Tyr Lys Leu Phe Asp lie 
20 



<210> 56 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence : recombinant 
A6 -derived peptide 

<400> 56 

Gly Ser Gin Ala lie Lys Asn Leu Ser Glu Cys Leu Val Arg Ser Asp 
15 10 15 

Asp Tyr Lys Lys Phe Asp lie 
. 20 



<210> 57 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 57 

Gly Arg Tyr Phe Gin Ser Lys He Thr Ser Cys Glu Asn Asn Asp Arg 
15 10 15 

Asp Tyr Lys Leu Phe Asp He 
20 



<210> 58 
<211> 23 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : recombinant 
A6 -derived peptide 

<400> 58 

Val Gin Tyr Gly Lys His Arg Arg Gly Ser Ser He Glu Val His Pro 
15 10 15 

Glu Tyr Lys Asp Phe Asp He 
20 



<210> 59 
<211> 21 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 59 

gccaccacta gcttgtaatt g 



<210> 60 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 60 

caattacaag aaagtggtgg cggactggtg caaccaggag gatccctgag actc 
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